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15 SECRETED PROTEINS AND POLYNUCLEOTIDES ENCODING THEM 

This application is a continuation-in-part of application Ser. No. 08/740,274, filed 
October 25, 1996. 

20 FIELD OF THE INVENTION 

The present invention provides novel polynucleotides and proteins encoded by 
such polynucleotides, along with therapeutic, diagnostic and research utilities for these 
polynucleotides and proteins. 

25 BACKGROUND OF THE INVENTION 

Tecimoiugy ^imed at the discovery ot protein factors (including e.g., cytokines, 
such as lymphokines, interferons, CSFs and interleukins) has matured rapidly over the 
past decade. The now routine hybridization cloning and expression cloning techniques 
clone novel polynucleotides "directly" in the sense that they rely on information directly 

3 0 related to the discovered protein (i.e., partial DNA/amino add sequence of the protein 
in the case of hybridization cloning; activity of the protein in the case of expression 
cloning). More recent "indirect** cloning techniques such as signal sequence doning, which 
isolates DNA sequences based on the presence of a now well-recognized secretory leader 
sequence motif, as well as various PCR-based or low stringency hybridization clorung . 

3 5 techniques^ have advanced the state of the art by making available large numbers of 
DNA/ amino acid sequences for proteins that are known to have biological activity by 
virtue of their secreted nature in the case of leader sequence cloning, or by virtue of the 
cell or tissue source in the case of PCR-based techniques. It is to these proteins and the 
polynucleotides encoding them that the present invention is directed. 
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SUMMARY OF THE INVENTTOM 
In one embodiment the present invention provides a composition comprising an 
isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

5 NO:l; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:l from nucleotide 437 to nucleotide 1159; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:l from nucleotide 515 to nucleotide 1 159; 

■^^ (d) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:l from nucleotide 539 to nucleotide 1099; 

(e) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone AR415_4 deposited under accession 
number ATCC 98232; 

^ (f) a polynucleotide encoding the full-length protein encoded by the 

cDNA insert of clone AR415.4 deposited under accession number ATCC 98232; 

(g) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone AR415_4 deposited under accession number 
ATCC 98232; 

(h) a polynucleotide encoding the matiire protein encoded by the 
cDNA insert of clone AR415_4 deposited under accession number ATCC 98232; 

(i) a polynucleotide encoding a protein comprising the amino add 
sequence of SEQ ID N0:2; 

0) a polynucleotide encoding a protein comprising a fragment of the 
2 5 amino acid sequence of SEQ ID NO:2 having biological activity; 

(k) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(h) above; 

(1) a polynucleotide which encodes a species homologue of the protein 
of (i) or (j) above ; and 

^ ^ (m) a polynucleotide capable of hybridizing under stringent conditions 

to any one of the polynucleotides specified in (a)-(j). 

Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 
NO:l from nucleotide 437 to nucleotide 1159; the nucleotide sequence of SEQ ID N0:1 
from nucleotide 515 to nucleotide 1159; the nucleotide sequence of SEQ ID NO:l from 
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nucleotide 539 to nucleotide 1099; the nucleotide sequence of the full-length protein 
coding sequence of clone AR415_4 deposited under accession number ATCC 98232; or the 
nucleotide sequence of the mature protein coding sequence of clone AR415„4 deposited 
under accession number ATCC 98232. In other preferred embodiments, the 
5 polynucleotide encodes the full-length or mature protein encoded by the cDNA insert of 
clone AR415_4 deposited under accession number ATCC 98232. In yet other preferred 
embodiments, the present invention provides a polynucleotide encoding a protein 
comprising the amino acid sequence of SEQ ID NO:2 from amino acid 51 to amino acid 
221. 

1 0 Other embodiments provide the gene corresponding to the cDNA sequence of SEQ 

IDNO:l. 

In other embodiments, the present invention provides a composition comprising 
a protein, wherein said protein comprises an amino add sequence selected from the group 
consisting of: 

1 5 (a) the amino acid sequence of SEQ ID NO:2; 

(b) the amino add sequence of SEQ ID NO:2 from amino add 51 to 
amino acid 221; 

(c) fragments of the amino add sequence of SEQ ID NO:2; and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
2 0 AR415„4 deposited under accession niunber ATCC 98232; 

the protein being substantially free from other mammalian proteins. Preferably such 
protein comprises the amino acid sequence of SEQ ID NO:2 or the amino acid sequence 
of SEQ ID NO:2 from amino acid 51 to amino acid 221. 

In one embodiment, the present invention provides a composition comprising an 
2 5 isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:3; 

(b) a polynucleotide comprising the nudeotide sequence of SEQ ID 
NO:3 from nucleotide 59 to nucleotide 376; 

30 (c) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:3 from nudeotide 179 to nucleotide 376; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of done AS63_29 deposited under accession 
number ATCC 98232; 
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(e) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone AS63_29 deposited under accession number ATCC 98232; 

(f) a poljmucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone AS63_29 deposited under accession number 

5 ATCC 98232; 

(g) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone AS63_29 deposited imder accession number ATCC 98232; 

(h) a polynucleotide encoding a protein comprising the amino add 
sequence of SEQ ID NO:4; 

10 (i) a polynucleotide encoding a protein comprising a fragment of the 

arnino acid sequence of SEQ ID NO:4 having biological activity; 

(j) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(g) above; 

(k) a polynucleotide which encodes a species homologue of the protein 
15 of (h) or (i) above ; and 

(1) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(i). 

Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 
NO:3 from nucleotide 59 to nucleotide 376; the nucleotide sequence of SEQ ID NO:3 from 

2 0 nucleotide 179 to nucleotide 376; the nucleotide sequence of the fulHength protein coding 
sequence of done AS63„29 deposited under accession niunber ATCC 98232; or the 
nudeotide sequence of the mature protein coding sequence of done AS63_29 deposited 
under accession number ATCC 98232. In other preferred embodiments, the 
polynudeotide encodes the ftill-length or mature protein encoded by the cDNA insert of 

2 5 clone AS63_29 deposited under accession number ATCC 98232, In yet other preferred 
embodiments, the present invention provides a polynucleotide encoding a protein 
comprising the amino acid sequence of SEQ ID NO:4 from amino add 1 to amino add 91. 

Other embodiments provide the gene corresponding to the cDNA sequence of SEQ 
IDN0:3 or SEQIDNOS. 

30 In other embodiments, the present invention provides a composition comprising 

a protein, wherein said protein comprises an amino add sequence selected from the group 
consisting of: 

(a) the amino add sequence of SEQ ID NO:4; 
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(b) the amino add sequence of SEQ ID NO:4 from amino acid 1 to 
amino acid 91; 

(c) fragments of the amino acid sequence of SEQ ID NO:4; and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
5 AS63_29 deposited under accession nxmiber ATCC 98232; 

the protein being substantially free from other mammalian proteins. Preferably such 
protein comprises the amino add sequence of SEQ ID NO:4 or the amino acid sequence 
of SEQ ID NO:4 from amino add 1 to amino add 91. 

In one embodiment, the present invention provides a composition comprising an 
1 0 isolated polynucleotide selected from the group consisting of: 

(a) a polynudeotide comprising the nucleotide sequence of SEQ ID 

NO:6; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:6 from nucleotide 198 to nucleotide 2039; 

15 (c) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:6 from nucleotide 490 to nucleotide 809; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone AY304_14 deposited under accession 
number ATCC xxxxx; 

20 (e) a polynudeotide encoding the full-length protein encoded by the 

cDNA insert of clone AY304_14 deposited under accession number ATCC xxxxx; 

(f) a polynudeotide comprising the nudeotide sequence of the mature 
protein coding sequence of done AY304_14 deposited under accession number 
ATCC xxxxx; 

25 (g) a polynudeotide encoding the mature protein encoded by the 

cDNA insert of done AY304_14 deposited under accession nimiber ATCC xxxxx; 

(h) a polynucleotide encoding a protein comprising the amino add 
sequence of SEQ ID NO:7; 

(i) a polynudeotide encoding a protein comprising a fragment of the 
3 0 amino add sequence of SEQ ID NO:7 having biological activity; 

(j) a poljTiudeotide which is an allelic variant of a polynucleotide of 
(a)-(g) above; 

(k) a polynucleotide which encodes a spedes homologue of the protein 
of (h) or (i) above ; and 
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(1) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(i). 

Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 
NO:6 from nucleotide 198 to nucleotide 2039; the nucleotide sequence of SEQ ID Na6 
from nucleotide 490 to nucleotide 809; the nucleotide sequence of the full-length protein 
coding sequence of done AY304_14 deposited under accession number ATCC xxxxx; or 
the nucleotide sequence of the mahire protein coding sequence of done AY304_14 
deposited under accession number ATCC xxxxx. In other preferred embodiments, the 
polynudeotide encodes the full-length or mature protein encoded by the cDNA insert of 
done AY304_14 deposited under accession number ATCC xxxxx. In yet other preferred 
embodiments, the present invention provides a polynudeotide encoding a protein 
comprising the amino add sequence of SEQ ID NO:7 from amino add 126 to amino add 
204 or a polynudeotide encoding a protein comprising the amino add sequence of SEQ 
ID NO:7 from amino add 106 to amino add 204. 

Other embodiments provide the gene corresponding to the cDNA sequence of SEQ 
IDNO:6. 

In other embodiments, the present invention provides a composition comprising 
a protein, wherein said protein comprises an amino add sequence selected from the group 
consisting of: 

^ ^ (a) the amino acid sequence of SEQ ID NO:7; 

(b) the amino add sequence of SEQ ID NO:7 from amino add 126 to 
amino add 204; 

(c) the amino add sequence of SEQ ID NO:7 from amino add 106 to 
amino add 204; 

^ ^ fragments of the amino add sequence of SEQ ID NO:7; and 

(e) the amino acid sequence encoded by the cDNA insert of done 
AY304_14 deposited under accession number ATCC xxxxx; 
the protein being substantiaUy free from other mammalian proteins. Preferably such 
protein comprises the amino add sequence of SEQ ID NO:7; the amino acid sequence of 
SEQ ID NO:7 from amino add 126 to amino add 204; or the amino add sequence of SEQ 
ID NO:7 from amino add 106 to amino add 204. 

In one embodiment, the present invention provides a composition comprising an 
isolated polynudeotide selected from the group consisting of: 
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(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:8; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:8 from nucleotide 102 to nucleotide 2027; 

5 (c) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:8 from nucleotide 1902 to nucleotide 2027; 

(d) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:8 from nucleotide 1 to nucleotide 431; 

(e) a polynucleotide comprising the nucleotide sequence of the full- 
10 length protein coding sequence of clone BG160„1 deposited under accession 

number ATCC 98232; 

(f) a polynucleotide encoding the fuD-length protein encoded by the 
cDNA insert of clone BG160_1 deposited under accession number ATCC 98232; 

(g) a polynucleotide comprising the nucleotide sequence of the mature 
15 protein coding sequence of clone BG160_1 deposited under accession number 

ATCC 98232; 

(h) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone BG160_1 deposited tmder accession nimiber ATCC 98232; 

(i) a polynucleotide encoding a protein comprising the amino add 

2 0 sequence of SEQ ID NO:9; 

(j) a polynucleotide encoding a protein comprising a fragment of the 
amino add sequence of SEQ ID NO:9 having biological activity; 

(k) a polynudeotide which is an allelic variant of a polynudeotide of 
(a)-(h) above; 

25 (1) a polynudeotide which encodes a spedes homologue of the protein 

of (i) or (j) above ; and 

(m) a polynucleotide capable of hybridizing imder stringent conditions 
to any one of the polynucleotides specified in (a)-(j). 

Preferably, such polynudeotide comprises the nucleotide sequence of SEQ ID 

3 0 NO:8 from nucleotide 102 to nucleotide 2027; the nudeotide sequence of SEQ ID NO:8 

from nudeotide 1902 to nudeotide 2027; the nucleotide sequence of SEQ ID NO:8 from 
nucleotide 1 to nudeotide 431; the nucleotide sequence of the full-length protein coding 
sequence of done BG160_1 deposited tmder accession number ATCC 98232; or the 
nudeotide sequence of the mature protein coding sequence of done BG160_1 deposited 
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under accession number ATCC 98232. In other preferred embodiments, the 
polynucleotide encodes the full-length or mature protein encoded by the cDNA insert of 
clone BG160_1 deposited under accession number ATCC 98232. In yet other preferred 
embodiments, the present invention provides a polynucleotide encoding a protein 
5 comprising the amino add sequence of SEQ ID NO:9 from amino acid 1 to amino add 110. 

Other embodiments provide the gene corresponding to the cDNA sequence of SEQ 
IDNO:8. 

In other embodiments, the present invention provides a composition comprising 
a protein, wherein said protein comprises an amino add sequence selected from the group 
10 consisting of : 

(a) the amino add sequence of SEQ ID NO:9; 

(b) the amino add sequence of SEQ ID NO:9 from amino add 1 to 
amino acid 110; 

(c) fragments of the amino add sequence of SEQ ID NO:9; and 

(d) the amino add sequence encoded by the cDNA insert of clone 
BG160_1 deposited under accession number ATCC 98232; 

the protein being substantially free from other mammalian proteins. Preferably such 
protein comprises the amino acid sequence of SEQ ID NO:9 or the amino add sequence 
of SEQ ID NO:9 from amino add 1 to amino acid 110. 
20 In one embodiment the present invention provides a composition comprising an 

isolated poljmucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:ll; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 

2 5 NO:ll from nudeotide 566 to nudeotide 631; 

(c) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of done B0432_4 deposited under accession 
number ATCC 98232; 

(d) a polynucleotide encoding the full-length protein encoded by the 

3 0 cDNA insert of clone B0432_4 deposited under accession number ATCC 98232; 

(e) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of done B0432_4 deposited under accession number 
ATCC 98232; 
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(f) a polynucleotide encoding the mature protein encoded by the 

cDNA insert of clone B0432_4 deposited under accession number ATCC 98232; 

(g) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:12; 

5 (h) a polynucleotide encoding a protein comprising a fragment of the 

amino add sequence of SEQ ID NO: 12 having biological activity; 

(i) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(f) above; 

(j) a polynucleotide which encodes a species homologue of the protein 
10 of (g) or (h) above ; and 

(k) a polynucleotide capable of hybridizing imder stringent conditions 
to any one of the polynucleotides specified in {a)-(h). 

Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 
NO:ll from nucleotide 566 to nucleotide 631; the nucleotide sequence of the full-length 

15 protein coding sequence of clone B0432_4 deposited under accession number ATCC 
98232; or the nucleotide sequence of the mature protein coding sequence of done B0432_4 
deposited under accession number ATCC 98232. In other preferred embodiments, the 
polynudeotide encodes the full-length or mature protein encoded by the cDNA insert of 
done B0432_4 deposited imder accession number ATCC 98232. 

2 0 Other embodiments provide the gene corresponding to the cDNA sequence of SEQ 

ID NO:ll, SEQ ID NO:10 or SEQ ID NO:13 . 

In other embodiments, the present invention provides a composition comprising 
a protein, wherein said protein comprises an amino add sequence selected from the group 
consisting of: 

2 5 (a) the amino add sequence of SEQ ID NO:12; 

(b) fragments of the amino add sequence of SEQ ID NO:12; and 

(c) the amino add sequence encoded by the cDNA insert of done 
B0432_4 deposited tmder accession number ATCC 98232; 

the protein being substantially free from other mammalian proteins. Preferably such 

3 0 protein comprises the amino add sequence of SEQ ID NO:12. 

In one embodiment, the present invention provides a composition comprising an 
isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:14; 
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(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:14 from nucleotide 45 to nucleotide 428; 

(c) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone B0538_2 deposited under accession 

5 number ATCC 98232; 

(d) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone B0538_2 deposited under accession number ATCC 98232; 

(e) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone B0538_2 deposited under accession number 

10 ATCC 98232; 

(f) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone B0538_2 deposited under accession number ATCC 98232; 

(g) a polynucleotide encoding a protein comprising the amino add 
sequence of SEQ ID NO:15; 

^ (h) a polynudeotide encoding a protein comprising a fragment of the 

amino add sequence of SEQ ID NO:15 having biological activity; 

(i) a polynudeotide which is an allelic variant of a polynucleoHde of 
(a)-(f) above; 

(j) a polynucleotide which encodes a species homologue of the protein 
20 of (g) or (h) above ; and 

(k) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(h). 

Preferably, such polynucleotide comprises the nudeotide sequence of SEQ ID 
NO:14 from nudeotide 45 to nucleotide 428; the nucleotide sequence of the full-length 

25 protein coding sequence of done B0538_2 deposited under accession number ATCC 
98232; or the nudeotide sequence of the mature protein coding sequence of done B0538.2 
deposited under accession number ATCC 98232. In other preferred embodiments, the 
polynucleotide encodes the full-length or mahire protein encoded by the cDNA insert of 
clone B0538_2 deposited under accession number ATCC 98232. In yet other preferred 

30 embodiments, the present invention provides a polynudeotide encoding a protein 
comprising the amino add sequence of SEQ ID NO:15 from amino add 52 to amino add 
128. 

Other embodiments provide the gene corresponding to the cDNA sequence of SEQ 
ID NO:14 or SEQ ID NO:16. 
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In other embodiments, the present invention provides a composition comprising 
a protein, v^herein said protein comprises an amino add sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID NO:15; 

5 (b) the amino acid sequence of SEQ ID NO:15 from amino acid 52 to 

amino add 128; 

(c) fragments of the amino acid sequence of SEQ ID NO:15; and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
B0538_2 deposited xmder accession number ATCC 98232; 

10 the protein being substantially free from other mammalian proteins. Preferably such 
protein comprises the amino add sequence of SEQ ID NO:15 or the amino acid sequence 
of SEQ ID NO:15 from amino add 52 to amino add 128. 

In one embodiment, the present invention provides a composition comprising an 
isolated polynucleotide selected from the group consisting of: 
15 (a) a polynucleotide comprising the nudeotide sequence of SEQ ID 

NO:17; 

(b) a polynucleotide comprising the nudeotide sequence of SEQ ID 
NO:17 from nudeotide 144 to nucleotide 566; 

(c) a polynucleotide comprising the nucleotide sequence of the full- 
20 length protein coding sequence of clone BR595_4 deposited imder accession 

number ATCC 98232; 

(d) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of done BR595_4 deposited under accession number ATCC 98232; 

(e) a polynudeotid e comprising the nucleotide sequence of the mature 
25 protein coding sequence of clone BR595_4 deposited under accession mmiber 

ATCC 98232; 

(f) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone BR595_4 deposited under accession number ATCC 98232; 

(g) a polynucleotide encoding a protein comprising the amino add 
3 0 sequence of SEQ ID NO:18; 

(h) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:18 having biological activity; 

(i) a polynucleotide which is an allelic variant of a polynudeotide of 
(a)-(f) above; 
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0) a polynucleotide which encodes a species homologue of the protein 
of (g) or (h) above ; and 

(k) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(h). 

Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 
NO:17 from nucleotide 144 to nucleotide 566; the nucleotide sequence of the full-length 
protein coding sequence of done BR595_4 deposited under accession number ATCC 
98232; or the nucleotide sequence of the mature protein coding sequence of clone BR595_4 
deposited under accession number ATCC 98232. In other preferred embodiments, the 
polynucleotide encodes the full-length or mahire protein encoded by the cDNA insert of 
clone BR595_4 deposited under accession number ATCC 98232. In yet other preferred 
embodiments, the present invention provides a polynucleotide encoding a protein 
comprising the amino add sequence of SEQ ID NO:18 from amino add 39 to amino add 
141. 

Other embodiments provide the gene corresponding to the cDNA sequence of SEQ 
ID NO:17 or SEQ ID NO:19. 

In other embodiments, the present invention provides a composition comprising 
a protein, wherein said protein comprises an amino add sequence selected from the group 
consisting of: 

^ ^ (a) *e amino add sequence of SEQ ID NO:18; 

(b) the amino add sequence of SEQ ID NO:18 from amino add 39 to 
amino add 141; 

(c) fragments of the amino acid sequence of SEQ ID NO:18; and 

(d) the amino acid sequence encoded by the cDNA insert of clone 

2 5 BR595_4 deposited under accession number ATCC 98232; 

the protein being substantially free from otiier mammalian protdns. Preferably such 
protein comprises the amino add sequence of SEQ ID NO:18 or the amino add sequence 
of SEQ ID NO:18 from amino add 39 to amino add 141. 

In one embodiment, the present invention provides a composition comprising an 

3 0 isolated polynucleotide seleded from the group consisting of: 

(a) a polynudeotide comprising the nudeotide sequence of SEQ ID 

NO:20; 

(b) a polynudeotide comprising the nudeotide sequence of SEQ ID 
NO:20 from nucleotide 232 to nucleotide 1041; 
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(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:20 from nucleotide 460 to nucleotide 1041; 

(d) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO;20 from nucleotide 590 to nucleotide 1163; 

5 (e) a polynucleotide comprising the nucleotide sequence of the full- 

length protein coding sequence of clone CI490_2 deposited under accession 
number ATCC 98232; 

(f) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone a490_2 deposited under accession number ATCC 98232; 
10 (g) a polynucleotide comprising the nucleotide sequence of the mature 

protein coding sequence of done CI490_2 deposited under accession nimiber 
ATCC 98232; 

(h) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone a490_2 deposited under accession number ATCC 98232; 
15 (i) a polynucleotide encoding a protein comprising the amino add 

sequence of SEQ ID NO:21; 

Q) a polynucleotide encoding a protein comprising a fragment of the 
amino add sequence of SEQ ID NO:21 having biological activity; 

(k) a polynucleotide which is an allelic variant of a polynucleotide of 
20 (a>-(h) above; 

(1) a polynucleotide which encodes a spedes homologue of the protein 
of (i) or (j) above ; and 

(m) a polynudeotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(j). 
25 Preferably, such polynucleotide comprises the nudeotide sequence of SEQ ID 

NO:20 from nucleotide 232 to nudeotide 1041; the nudeotide sequence of SEQ ID NO:20 
from nudeotide 460 to nucleotide 1041; the nudeotide sequence of SEQ ID NO:20 from 
nucleotide 590 to nucleotide 1163; the nudeotide sequence of the full-length protein 
coding sequence of clone CI490_2 deposited under accession nimiber ATCC 98232; or the 
3 0 nucleotide sequence of the mature protein coding sequence of done CI490_2 deposited 
under accession number ATCC 98232. In other preferred embodiments, the 
polynudeotide encodes the fuU-Iength or mature protein encoded by the cDNA insert of 
clone CI490„2 deposited under accession number ATCC 98232. In yet other preferred 
embodiments, the present invention provides a polynucleotide encoding a protein 
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comprising the amino acid sequence of SEQ ID NO:21 from amino acid 133 to amino acid 
270. 

Other embodiments provide the gene corresponding to the cDNA sequence of SEQ 
IDNO:20. 

In other embodiments, the present invention provides a composition comprising 
a protein, wherein said protein comprises an amino add sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID NO:21; 

(b) the amino add sequence of SEQ ID NO:21 from amino add 133 to 
amino add 270; 

(c) fragments of the amino acid sequence of SEQ ID NO:21; and 

(d) the amino add sequence encoded by the cDNA insert of done 
a490_2 deposited under accession number ATCC 98232; 

the protein being substantially free from other mammalian proteins. Preferably such 
protein comprises the amino add sequence of SEQ ID NO:21 or the amino add sequence 
of SEQ ID NO:21 from amino add 133 to amino acid 270. 

In one embodiment, the present invention provides a composition comprising an 
isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:22; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:22 from nucleotide 268 to nudeotide 624; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:22 from nucleotide 325 to nudeotide 624; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of done CI522_1 deposited under accession 
number ATCC 98232; 

(e) a polynudeotide encoding the full-length protein encoded by the 
cDNA insert of clone CI522_1 deposited under accession number ATCC 98232; 

(f) a polynucleotide comprising the nudeotide sequence of the mature 
protein coding sequence of clone CI522_1 deposited under accession number 
ATCC 98232; 

(g) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone CI522_1 deposited under accession number ATCC 98232; 



14 



wo 98/17687 PCT/US97/19590 

(h) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:23; 

(i) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:23 having biological activity; 

5 (j) ^ polynucleotide which is an allelic variant of a polynucleotide of 

(a)-(g) above; 

(k) a polynucleotide which encodes a spedes homologue of the protein 
of (h) or (i) above ; and 

(1) a polynucleotide capable of hybridizing under stringent conditions 
10 to any one of the polynucleotides specified in (a)-(i). 

Preferably^ such polynucleotide comprises the nucleotide sequence of SEQ ID 
NO:22 from nucleotide 268 to nucleotide 624; the nucleotide sequence of SEQ ID NO:22 
from nucleotide 325 to nucleotide 624; the nucleotide sequence of the full-length protein 
coding sequence of clone CI522_1 deposited tmder accession number ATCC 98232; or the 
1 5 nucleotide sequence of the mature protein coding sequence of clone CI522_1 deposited 
under accession number ATCC 98232. In other preferred embodiments, the 
polynucleotide encodes the full-length or mature protein encoded by the cDNA insert of 
clone CI522_1 deposited under accession number ATCC 98232. 

Other embodiments provide the gene corresponding to the cDN A sequence of SEQ 
20 ID NO:22 or SEQ ID NO:24. 

In other embodiments, the present invention provides a composition comprising 
a protein, wherein said protein comprises an amino add sequence selected from the group 
consisting of: 

(a) the amino add sequence of SEQ ID NO:23; 
25 (b) fragments of the amino add sequence of SEQ ID NO:23; and 

(c) the amino add sequence encoded by the cDNA insert of clone 
CI522_1 deposited under accession niunber ATCC 98232; 
the protein being substantially free from other mammalian proteins. Preferably such 
protein comprises the amino acid sequence of SEQ ID NO:23. 
30 In one embodiment, the present invention provides a composition comprising an 

isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nudeotide sequence of SEQ ID 

NO:25; 
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(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:25 from nucleotide 288 to nucleotide 713; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:25 from nucleotide 686 to nucleotide 968; 

5 • (d) a polynucleotide comprising the nucleotide sequence of the fuU- 

length protein coding sequence of clone CN238_1 deposited under accession 
niunber ATCC 98232; 

(e) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone CN238_1 deposited under accession number ATCC 98232; 
^ (0 a polynucleotide comprising the nucleotide sequence of the mature 

protein coding sequence of clone CN238_1 deposited under accession number 
ATCC 98232; 

(g) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone CN238_1 deposited under accession number ATCC 98232; 

(h) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:26; 

(i) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:26 having biological activity; 

(j) a polynucleotide which is an allelic variant of a polynucleotide of 
(aHg) above; 

(k) a polynucleotide which encodes a species homologue of the protein 
of (h) or (i) above ; and 

(1) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(i). 
25 Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 

NO:25 from nucleotide 288 to nucleotide 713; the nucleotide sequence of SEQ ID NO:25 
from nucleotide 686 to nucleotide 968; the nucleotide sequence of the full-length protein 
coding sequence of clone CN238_1 deposited under accession number ATCC 98232; or the 
nucleotide sequence of the mature protein coding sequence of clone CN238.1 deposited 
30 under accession number ATCC 98232. In other preferred embodiments, the 
polynucleotide encodes the full-length or mature protein encoded by the cDNA insert of 
done CN238_1 deposited under accession number ATCC 98232. 

Other embodiments provide the gene corresponding to the cDNA sequence of SEQ 
IDNO:25. 
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In other embodiments, the present invention provides a composition comprising 
a protein, wherein said protein comprises an amino add sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID NO:26; 
5 (b) fragments of the amino add sequence of SEQ ID NO:26; and 

(c) the amino acid sequence encoded by the cDNA insert of done 
CN238_1 deposited imder accession number ATCC 98232; 
the protein being substantially free from other mammalian proteins. Preferably such 
protdn comprises the amino add sequence of SEQ ID NO:26. 
10 In one embodiment, the present invention provides a composition comprising an 

isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:27; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
1 5 NO:27 from nucleotide 87 to nucleotide 1874; 

(c) a polynucleotide comprising the nudeotide sequence of SEQ ID 
NO:27 from nudeotide 452 to nucleotide 830; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of done CO390_l deposited under accession 

2 0 number ATCC 98232; 

(e) a polynudeotide encoding the full-length protein encoded by the 
cDNA insert of done CO390_l deposited under accession number ATCC 98232; 

(f) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of done CO390„l deposited under accession number 

25 ATCC 98232; 

(g) a polynudeotide encoding the mature protein encoded by the 
cDNA insert of clone CO390_l deposited under accession number ATCC 98232; 

(h) a polynucleotide encoding a protein comprising the amino add 
sequence of SEQ ID NO:28; 

30 (i) a polynudeotide encoding a protein comprising a fragment of the 

amino add sequence of SEQ ID NO:28 having biological activity; 

(j) a polynucleotide which is an allelic variant of a polynudeotide of 
(a)-(g) above; 
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(k) a polynucleotide which encodes a species homologue of the protein 
of (h) or (i) above ; and 

(1) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(i). 

Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 
NO:27 from nucleotide 87 to nucleotide 1874; the nucleotide sequence of SEQ ID NO;27 
from nucleotide 452 to nucleotide 830; the nucleotide sequence of the full-length protein 
coding sequence of clone CO390_l deposited under accession number ATCC 98232; or the 
nucleotide sequence of the mature protein coding sequence of clone CO390_l deposited 
under accession number ATCC 98232. In other preferred embodiments, the 
polynucleotide encodes the full-length or mature protein encoded by the cDNA insert of 
clone CO390_l deposited imder accession number ATCC 98232. In yet other preferred 
embodiments, the present invention provides a polynucleotide encoding a protein 
comprising the amino acid sequence of SEQ ID NO:28 from amino acid 140 to amino acid 
248. 

Other embodiments provide the gene corresponding to the cDNA sequence of SEQ 
IDNO:27. 

In other embodiments, the present invention provides a composition comprising 
a protein, wherein said protein comprises an amino acid sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID NO:28; 

(b) the amino add sequence of SEQ ID NO:28 from amino acid 140 to 
amino acid 248; 

(c) fragments of the amino add sequence of SEQ ID NO:28; and 

(d) the amino add sequence encoded by the cDNA insert of done 
CO390_l deposited under accession number ATCC 98232; 

the protein being substantially free from other mammalian proteins. Preferably such 
protein comprises the amino add sequence of SEQ ID NO:28 or the amino add sequence 
of SEQ ID NO:28 from amino acid 140 to amino acid 248. 

In certain preferred embodiments, the polynucleotide is operably linked to an 
expression control sequence. The invention also provides a host cell, including bacterial, 
yeast, insect and mammalian cells, transformed with such polynucleotide compositions. 

Processes are also provided for producing a protdn, which comprise: 
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(a) growing a culture of the host cell transformed with such 
polynucleotide compositions in a suitable culture medium; and 

(b) purifying the protein from the culture. 

The protein produced according to such methods is also provided by the present 
5 invention. Preferred embodiments include those in which the protein produced by such 
process is a mature form of the protein. 

Protein compositions of the present invention may fturther comprise a 
pharmaceutically acceptable carrier. Compositions comprising an antibody which 
specifically reacts vdth such protein are also provided by the present invention. 
10 Methods are also provided for preventing, treating or ameliorating a medical 

condition which comprises administering to a mammalian subject a therapeutically 
effective amount of a composition comprising a protein of the present invention and a 
pharmaceutically acceptable carrier. 

15 BRIEF DESCRIPTION OF DRAWINGS 

Fig. 1 is a schematic representation of the pED6 and pNOTs vectors used for 
deposit of clones disclosed herein. 

DETAILED DESCRIPTION 
20 ISOLATED PROTEINS AND POLYNUCLEOTIDES 

Nucleotide and amino acid sequences, as presently determined, are reported 
below for each clone and protein disclosed in the present application. The nucleotide 
sequence of each done can readily be determined by sequencing of the deposited clone 
in accordance with known methods. The predicted amino acid sequence (both full-length 

25 and mature) can then be determined from such nucleotide sequence. The amino acid 
sequence of the protein encoded by a particular clone can also be determined by 
expression of the done in a suitable host ceD, collecting the protein and determining its 
sequence. For each disdosed protein applicants have identified what they have 
detennined to be the reading frame best identifiable with sequence information avaUable 

30 at the time of filing. 

As used herein a "secreted" protein is one which, when expressed in a suitable host 
cell, is transported across or through a membrane, including transport as a result of signal 
sequences in its amino acid sequence. "Secreted" proteins include without limitation 
proteins secreted whoUy (e.g., soluble proteins) or partially (e.g. , receptors) from the cell 
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in which they are expressed. "Secreted" proteins also include without limitation proteins 
which are transported across the membrane of the endoplasmic reticulum. 

Clone "AR415 4" 

5 A polynucleotide of the present invention has been identified as clone ■'AR415_4". 

AR415_4 was isolated from a human adult retina cDNA library using methods which are 
selective for cDNAs encoding secreted proteins (see U.S. Pat. No. 5^6,637), or was 
identified as encoding a secreted or transmembrane protein on the basis of computer 
analysis of the amino acid sequence of the encoded protein. AR415_4 is a full-length 
1 0 done, including the entire coding sequence of a secreted protein (also referred to herein 
as "AR415_4 protein"). 

The nucleotide sequence of AR415_4 as presently determined is reported in SEQ 
ID NO:l. What applicants presently believe to be the proper reading frame and the 
predicted amino add sequence of the AR415_4 protein corresponding to the foregoing 
1 5 nucleotide sequence is reported in SEQ ID NO:2. Amino adds 14 to 26 are a predicted 
leader/signal sequence, with the predicted mature amino add sequence beginning at 
amino add 27, or are a transmembrane domain. 

The EcoRI/NotI restriction fragment obtainable from the deposit containing done 
AR415_4 should be approximately 1500 bp. 
20 The nucleotide sequence disclosed herein for AR415_4 was searched against the 

GenBank and GeneSeq nudeotide sequence databases using BLASTN/BLASTX and 
FASTA seardi protocols. AR415_4 demonstrated at least some homology with sequences 
identified as AA100799 (zm26d01.sl Stratagene pancreas (#937208) Homo sapiens cDNA 
done 526753 3'), AA100852 (zm26d01 j1 Stratagene pancreas (#937208) Homo sapiens 
25 cDNA done 526753 5* similar to SW CO02_HUMAN P19075 TUMOR-ASSOCIATED 
ANTIGEN C04)29), AA146605 (zo35d)9.rl Stratagene colon (#937204) Homo sapiens 
cDNA done 588880 5* similar to SW:CO02_HUMAN P19075 TUMOR-ASSOCIATED 
ANTIGEN C0^29), AA224847 (nc33cl2.sl NCI CGAP Pr2 Homo sapiens cDNA done 
4079 similar to SW:CO02_HUMAN P19075 TUMOR-ASSOCIATED ANTIGEN CO-029), 
30 AA225191 (nc21h08.sl NCI CGAP Prl Homo sapiens cDNA clone 2968), AA593864 
(nnl9f08.sl Na_CGAP_Col2 Homo sapiens cDNA done IMAGE:1084359), D26483 
(Mouse mRNA for PE31/TALLA. 3/ ), M33680 (Human 26-kDa ceU surface protdn 
TAPA-1 mRNA, complete cds), T14726 (Human CD53 antigen cDNA), and T23814 
(Human gene signature HUMGS05723). The predicted amino add sequence disdosed 
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herein for AR4153 was searched against the GenPept and GeneSeq amino add sequence 
databases using the BLASTX search protocol. The predicted AR415_4 protein 
demonstrated at least some identity with sequences identified as D29808 (TALLA-1 
[Homo sapiens]), M35252 (tumor-assodated antigen [Homo sapiens]), and R22360 
5 (CO-029 tumour associated antigen protein). Based upon homology, AR415_4 proteins 
and each homologous protein or peptide may share at least some activity. The TopPredll 
computer program predicts a potential transmembrane domain within the AR415_4 
protein sequence centered arotmd amino add 100 of SEQ ID NO:2. 



10 Qone"AS63_29" 

A polynucleotide of the present invention has been identified as clone " AS63_29". 
AS63„29 was isolated from a human fetal brain cDNA library using methods which are 
selective for cDNAs encoding secreted proteins (see U.S. Pat. No. 5,536,637), or was 
identified as encoding a secreted or transmembrane protein on the basis of computer 

1 5 analysis of the amino add sequence of the encoded protein. AS63_29 is a full-length done, 
including the entire coding sequence of a secreted protein (also referred to herein as 
"AS63„29 protein"). 

The nudeotide sequence of the 5' portion of AS63_29 as presently determined is 
reported in SEQ ID NO:3. What applicants presently believe is the proper reading frame 
2 0 for the coding region is indicated in SEQ ID NO:4. The predicted amino add sequence of 
the AS63_29 protein corresponding to the foregoing nudeotide sequence is reported in 
SEQ ID NO:4. Amino adds 28 to 40 are a predicted leader /signal sequence, with the 
predicted mature amino add sequence beginning at amino acid 41, or are a 
transmembrane domain. Additional nudeotide sequence from the 3* portion of AS63_29, 

2 5 including the polyA tail, is reported in SEQ ID NO:5. 

The EcoRI/NotI restriction fragment obtainable from the deposit containing done 
AS63_29 should be approximately 1700 bp. 

The nudeotide sequence disclosed herein for AS63_29 was searched against the 
GenBank and GeneSeq nudeotide sequence databases using BLASTN/BLASTX and 

3 0 FASTA search protocols. AS63_29 demonstrated at least some homology with sequences 

identified as L26877 (Mus musculus (B20c) heavy chain immunoglobulin variable region 
gene), T09146 (EST07039 Homo sapiens cDNA done HIBBP68 5' end), T23466 (seq3050 
Homo sapiens cDNA done Hyl8-Chl3-Charon40-cDNA-100 3'), and W55739 (ma35f05.r] 
Life Tech mouse brain Mus musculus cDNA done 312705 5'). The predicted amino add 
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sequence disclosed herein for AS63_29 was searched against the GenPept and GeneSeq 
amino add sequence databases using the BLASTX search protocol. The predicted 
AS63_29 protein demonstrated at least some identity with sequences identified as R04032 
(FuU length T4 encoded by plasmid pBG381). Based upon homology, AS63_29 proteins 
and each homologous protein or peptide may share at least some activity. The TopPredll 
computer program predicts a potential transmembrane domain within the AS63_29 
protein sequence, near the amino terminus. 



Clone "AY3Q4 14" 

10 A polynucleotide of the present invention has been identified as clone "AY304_14". 

AY304_14 was isolated from a human adult retina cDNA library using methods which are 
selective for cDNAs encoding secreted proteins (see U.S. Pat. No. 5^36,637), or was 
identified as encoding a secreted or transmembrane protein on the basis of computer 
analysis of the amino acid sequence of the encoded protein. AY304_14 is a full-length 

1 5 clone, including the entire coding sequence of a secreted protein (also referred to herein 
as "A Y304.14 protein"). 

The nucleotide sequence of AY304_14 as presently determined is reported in SEQ 
ID NO:6. What appUcants presently believe to be the proper reading frame and the 
predicted amino add sequence of the AY304_14 protein corresponding to the foregoing 

2 0 nucleotide sequence is reported in SEQ ID NO:7. 

The EcoRI/NotI restriction fragment obtainable from the deposit containing clone 
AY304_14 should be approximately 2200 bp. 

The nudeotide sequence disdosed herein for AY304_14 was searched against the 
GenBank and GeneSeq nucleotide sequence databases using BLASTN/BLASTX and 
25 FASTAsearchprotocols. AY304_14demonstratedatleastsomehomologywithsequences 
identified as AA127688 (zk92fD5.sl Scares pregnant uterus NbHFU Homo sapiens cDNA 
done 490305 3% AA179609 (zp49gll.rl Stratagene HeLa ceU s3 937216 Homo sapiens 
cDNA done 612836 5% AA276253 (vc40f05.rl Barstead MPLRBl Mus musculus cDNA 
done 777057 5'), H15545 (ym27d04.sl Homo sapiens cDNA clone 49495 3' similar to 

3 0 contains PTR5 repetitive element), L08441 (Human autonomously replicating sequence 

(ARS) mRNA), N34949 (yy49h09.sl Homo sapiens cDNA done 276929 3'), R48594 
(yj65d07.sl Homo sapiens cDNA done 153613 3% T21160 (Human gene signahire 
HUMGS02466), U43284 (Cloning vector phGFP-S65T, complete sequence, green 
fluorescent protein (gfp) gene, complete cds), and Z45151 (H. sapiens partial cDNA 
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sequence; done c-2hh04). The predicted amino add sequence disdosed herein for 
AY304_14 was searched against the GenPept and GeneSeq amino add sequence databases 
using the BLASTX search protocol. The predicted AY304_14 protein demonstrated at 
least some identity with sequences identified as D86984 (similar to yeast adenylate cyclase 
5 (S56776) [Homo sapiens)), J01415 (cytochrome oxidase subunit 3 [Homo sapiens]), V00662 
(cytochrome oxidase HI [Homo sapiens]), and X68948 (envelope glycoprotein [Spleen 
focus-forming virus]). Based upon homology, AY304_14 proteins and each homologous 
protein or peptide may share at least some activity. The TopPredll computer program 
predicts two potential transmembrane domains within the AY304_14 protein sequence, 
1 0 one centered aroxmd amino add 81 and another aroimd amino add 120 of SEQ ID N0:7. 

gone "BG160 1" 

A polynudeotide of the present invention has been identified as clone "BG160_1". 
BG160_1 was isolated from a him\an adult brain cDNA library using methods which are 
15 selective for cDNAs encoding secreted proteins (see U.S. Pat. No. 5,536,637), or was 
identified as encoding a secreted or transmembrane protein on the basis of computer 
analysis of the amino acid sequence of the encoded protein. BG160„1 is a full-length 
done, induding the entire coding sequence of a secreted protein (also referred to herein 
as "BG160_1 protein");, 

2 0 The nudeotide sequence of BG160 _1 as presently determined is reported in SEQ 

ID NO:8: What applicants presaitly bdieve to be the proper reading frame and the 
predicted amino add sequence of the BG160_1 protein corresponding to the foregoing 
nucleotifie sequence is reported in SEQ ID NO:9. Amino acids 588 to 600 are a predicted 
leader/signal sequence, with the predicted mature amino acid sequence begiiming at 

2 5 amino add 601, or are a transmembrane domain. 

The EcoRI/NotI restriction fragment obtainable from the deposit containing clone 
BG160_1 should be approximately 2300 bp. 

The nudeotide sequence disdosed herein for BG160_1 was searched against the 
GenBank and GeneSeq nudeotide sequence databases using BLASTN/BLASTX and 

3 0 FASTA search protocols. BG160 J demonstrated at least some homology with sequences 

identified as A60021 (tropomyosin-related protein, neuronal - rat ;contains element 
MER27 repetitive dement), AA081525 (zn20e02.rl Stratagene neuroepithelium NT2RAMI 
937234 Homo sapiens cDNA done 547994 5'), AA092565 (115773.seq.F Fetal heart. Lambda 
ZAP Express Homo sapiens cDNA 5*), D56138 (Human fetal brain cDNA 5-end 
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GEN-416H11), D6109O (Human fetal brain cDNA 5-end GEN-155A07), D61184 (Human 
fetal brain cDNA 5'-end GEN-165A01), L10335 (Homo sapiens neuro-endocrine-specific 
protein C (NSP) mRNA, complete cds), N21304 (yx53f07.sl Homo sapiens cDNA clone 
265477 3' similar to SP:A60021 A60021 TROPOMYOSIN-RELATED PROTEIN, 
5 NEURONAL), and W95814 (ze07fll.rl Soares fetal heart NbHH19W Homo sapiens 
cDNA clone 358317 5' similar to PIR:A60021). The predicted amino add sequence 
disclosed herein for BG160_1 was searched against the GenPept and GeneSeq amino add 
sequence databases using the BLASTK search protocol. The predicted BG160_1 protein 
demonstrated at least some identity with sequences identified as L10334 
10 (neuroendocrine-specific protein B [Homo sapiens)), L10335 (neuroendocrine-spedfic 
protein C [Homo sapiens]). Based upon homology, BG160_1 proteins and eadi 
homologous protein or peptide may share at least some activity. The TopPredD computer 
program predicts three potential transmembrane domains within the BG160_1 protein 
sequence, centered around amino adds 84, 484, and 595 of SEQ ID NO:9. 



15 



20 



25 



30 



Clone "B0432 4" 

A polynudeotide of the present invention has been identified as done "B0432_4". 
B0432_4 was isolated firom a human adult retina cDNA Ubraiy using methods which are 
selective for cDNAs encoding secreted proteins (see U.S. Pat. No. 5,536,637), or was 
identified as encoding a secreted or bransmembrane protein on the basis of computer 
analysis of the amino add sequence of the encoded protein. B0432_4 is a full-length 
clone, including the entire coding sequence of a secreted protein (also referred to herein 
as "B0432_4 protein"). 

The nucleotide sequence of the 5" portion of B0432_4 as presently determined is 
reported in SEQ ID NO:10. An additional internal nudeotide sequence from B0432_4 as 
presendy determined is reported in SEQ ID NO:l 1 . What applicants believe is the proper 
reading frame and the predicted amino add sequence encoded by such internal sequence 
is reported in SEQ ID NO:12. Additional nudeotide sequence from the 3' portion of 
B0432_4, including the polyA tail, is reported in SEQ ID NO:13. 

The EcoRI/NoQ restaiction fragment obtainable from tiie deposit containing done 
B0432_4 should be approximately 1700 bp. 

The nudeotide sequence disclosed herein for B0432^4 was searched against the 
GenBank and GeneSeq nudeotide sequence databases using BLASTN/BLASTX and 
FASTA search protocols. B0432_4 demonsfrated at least some homology with sequences 
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identified as AA283626 (ztl5e09,sl Scares NbHTGBC Homo sapiens cDNA clone 713224 
3'), AA406486 (zvl2g02.rl Scares NhHMPu SI Homo sapiens cDNA done 753458 5' 
similar to WP F35G2.2 CE05809 E.COLI YCAC LIKE), AA570446 (nk62cl2.sl 
Na_CGAP_Schl Homo sapiens cDNA clone IMAGE:1018102), N55855 03389? Homo 
5 sapiens cDNA done J3389 5'), Q10613 (Kanodin receptor gene), T62691 (ycZOdlO.rl Homo 
sapiens cDNA clone 86035 5*), and W90766 (zh79h04.sl Soares fetal liver spleen INFLS 
SI Homo sapiens cDNA done 418327 3*). The predicted amino add sequence disclosed 
herein for B0432_4 was searched against the GenPept and GeneSeq amino add sequence 
databases using the BLASTX search protocol. The predicted B0432_4 protein 

10 demonstrated at least some identity with sequences identified as Z69637 (F35G2.2 
[Caenorhabditis elegans)). Based upon homology, B0432_4 proteins and each 
homologous protein or peptide may share at least some activity. The TopPredll computer 
program predicts a potential transmembrane domain at the amine terminus of the 
B0432_4 protein sequence. The B0432„4 protein may also contain the bacterial lysR 

15 family signature, a motif foimd in bacterial transcriptional regulators and which is 
possibly indicative of a helix-tum-helix structure. 

Clone "B0538 T 

A polynudeotide of the present invention has been identified as clone "B0538_2". 

2 0 B0538_2 was isolated from a human adult retina cDNA library using methods which are 
selective for cDNAs encoding secreted proteins (see U.S. Pat. No. 5,536,637), or was 
identified as encoding a secreted or transmembrane protein on the basis of computer 
analysis of the amino add sequence of the encoded protein. B0538_2 is a full-length 
done, including the entire coding sequence of a secreted protein (also referred to herein 

25 as "B0538_2 protein"). 

The nucleotide sequence of the 5' portion of B0538_2 as presently determined is 
reported in SEQ ID Nai4. What applicants presentiy believe is the proper reading frame 
for the coding region is indicated in SEQ ID NO:15. The predicted amino add sequence 
of the B0538_2 protein corresponding to the foregoing nucleotide sequence is reported 

30 in SEQ ID NO:15. Additional nucleotide sequence from the 3' portion of B0538„2, 
including the polyA tail, is reported in SEQ ID NO:16. 

The EcoRI/NotI restriction fragment obtainable from the deposit containing clone 
B0538„2 should be approximately 3000 bp. 
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The nucleotide sequence disclosed herein for B0538_2 was searched against the 
GenBank and GeneSeq nucleotide sequence databases using BLASTN/BLASTX and 
FASTA search protocols. B0538_2 demonstrated at least some homology with sequences 
identified as AA503100 (ne44h01.sl NCI_CGAP_Co3 Homo sapiens cDNA done 900241), 
5 R44035 (yg21g09.sl Homo sapiens cDNA clone 33167 3'), T21630 (Human gene signahire 
HUMGS03066), and W64854 (me06dl2.rl Scares mouse embryo NbME13.5 14.5 Mus 
musculus cDNA done 386711 5' similar to PIR S40989 S40989 hypothetical protein 
F55H2.6 - Caenorhabditis elegans). The predicted amino add sequence disdosed herein 
for B0538_2 was searched against the GenPept and GeneSeq amino add sequence 
10 databases using the BLASTX seardi protocol. The predicted B0538_2 protein 
demonstrated at least some identity with sequences identified as M60525 (nerve growth 
factor indudble protein [Rattus norvegicus]), R28916 (Type IE procollagen), and Z27080 
(F55H2.6 [Caenorhabditis elegans]). Based upon homology, B0538_2 proteins and each 
homologous protein or peptide may share at least some activity. The TopPredn computer 
program predicts two potential transmembrane domains within the B0538_2 protein 
sequence. 
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Clone "BR595 4" 

A polynucleotide of the present invention has been identified as done "BR595_4". 
BR595_4 was isolated fi-om a human fetal kidney cDNA library using methods which are 
selective for cDNAs encoding secreted proteins (see U.S. Pat. No. 5,536,637), or was 
identified as encoding a secreted or b-ansmembrane protein on the basis of computer 
analysis of the amino add sequence of the encoded protein. BR595_4 is a full-length done, 
induding the entire coding sequence of a secreted protein (also referred to herein as 
25 "BR595_4 protein"). 

The nudeotide sequence of the 5' portion of BR595_4 as presently determined is 
reported in SEQ ID NO:17. What applicants presently believe is the proper reading frame 
for the coding region is indicated in SEQ ID Nai8. The predicted amino add sequence 
of the BR595_4 protein corresponding to the foregoing nudeotide sequence is reported in 
30 SEQ ID NO:18. Additional nucleotide sequence from the 3* portion of BR595_4, 
induding the polyA tail, is reported in SEQ ID NO:19. 

The EcoRI/NotI reshiction fragment obtainable fi-om the deposit containing done 
BR595_4 should be approximately 3000 bp. 
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The nucleotide sequence disclosed herein for BR595_4 was searched against the 
GenBank and GeneSeq nucleotide sequence databases using BLASTN/BLASTX and 
FASTA search protocols. BR595_4 demonstrated at least some homology with sequences 
identified as AA443742 (zw95b02.sl Soares total fetus Nb2HF8 9w Homo sapiens cDNA 
5 clone 784683 3*), AA600820 (np45b08.sl NCI_CGAP_Brl.l Homo sapiens cDNA clone 
IMAGE:1129239), T19410 (Human gene signature HUMGS00435), W87465 (zh67c04.sl 
Soares fetal liver spleen INFLS SI Homo sapiens cDNA clone 417126 3'), and Z33587 (H. 
sapiens partial cDNA sequence; done HEA89P; single read). Based upon homology, 
BR595_4 proteins and each homologous protein or peptide may share at least some 
10 activity. 

Clone "0490 2" 

A polynucleotide of the present invention has been identified as clone "a490_2". 
CI490_2 was isolated from a human adult brain cDNA library using methods which are 
15 selective for cDNAs encoding secreted proteins (see U.S. Pat. No. 5^36,637), or was 
identified as encoding a secreted or trar\smembrane protein on the basis of computer 
analysis of the amino acid sequence of the encoded protein. CI490_2 is a full-length clone, 
including the entire coding sequence of a secreted protein (also referred to herein as 
"a490_2 protein"), 

2 0 The nucleotide sequence of CI490_2 as presently determined is reported in SEQ 

ID NO:20. What applicants presently believe to be the proper reading frame and the 
predicted amino acid sequence of the CI490_2 protein corresponding to the foregoing 
nucleotide sequence is reported in SEQ ID NO:21. Amino acids 64 to 76 are a predicted 
leader/signal sequence, with the predicted mature amino acid sequence beginning at 

2 5 amino acid 77, or are a transmembrane domain. 

The EcoRI/NotI restriction fragment obtainable from the deposit containing clone 
CI490„2 should be approximately 1200 bp. 

The nucleotide sequence disclosed herein for CI490_2 was searched against the 
GenBank and GeneSeq nucleotide sequence databases using BLASTN/BLASTX and 

3 0 FASTA search protocols. a490_2 demonstrated at least some homology with sequences 

identified as H30751 (yo79a04.rl Homo sapiens cDNA clone 184110 5'), H49766 
(yo24f01.rl Homo sapiens cDNA clone 178873 5' similar to SP:S19586 N-METHYL- 
D-ASPARTATE RECEPTOR GLUTAMATE-BINDING CHAIN), H51158 (yo32d04.rl 
Homo sapiens cDNA done 179623 5*), R85211 (yo41dll.sl Homo sapiens cDNA clone 
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180501 3' similar to SP S19586 N-METHYL-D- ASPARTATE RECEPTOR GLUTAMATE- 
BINDING CHAIN), S19586 (N-METHYL-D-ASPARTATE RECEPTOR 
GLUTAMATE-BINDING CHAIN), S61973 (NMDA receptor glutamate-binding subtinit 
[rats, mRNA, 1742 nt]), T01031 (Human leucine zipper protein-kinase cDNA sequence), 
5 and W56893 (zc01g05.rl Soares parathyroid tumor NbHPA Homo sapiens cDNA clone 
321080 5' similar to PIR S19586 S19586 N-methyl-D-aspartate receptor glutamate-binding 
chain - rat). The predicted amino add sequence disclosed herein for CI490_2 was 
searched against the GenPept and GeneSeq amino acid sequence databases using the 
BLASTX search protocol. The predicted CI490_^ protein demonstrated at least some 

1 0 identity with sequences identified as S61973 (NMDA receptor glutamate-binding subunit 
[rats; Peptide, 516 aa] [Rattus sp.)) and U08020 (collagen pro-alpha-l type I chain [Mus 
musculus]). Based upon homology, CI490_2 proteins and each homologous protein or 
peptide may share at least some activity. The TopPredH computer program predicts six 
potential transmembrane domains within the CI490_2 protein sequence, with the most 

15 amino-terminal transmembrane domain centered around amino add 77 of SEQ ID NO:21. 

Clone "C1522 1" 

A polynucleotide of the present invention has been identified as clone "CI522_1". 
CI522_1 was isolated from a human adult brain cDNA library using methods which are 
2 0 selective for cDNAs encoding secreted proteins (see U.S. Pat. No. 5,536,637), or was 
identified as encoding a secreted or transmembrane protein on the basis of computer 
analysis of the amino acid sequence of the encoded protein. CI522_1 is a full-length clone, 
including the entire coding sequence of a secreted protein (also referred to herein as 
"CI522_1 protein"). 

2 5 The nucleotide sequence of the 5' portion of CI522_1 as presently determined is 

reported in SEQ ID NO:22. What applicants presently believe is the proper reading frame 
for the coding region is indicated in SEQ ID NO:23. The predicted amino acid sequence 
of the CI522_1 protein corresponding to the foregoing nucleotide sequence is reported in 
SEQ ID NO:23. Amino acids 7 to 19 are a predicted leader/ signal sequence, with the 

30 predicted mature amino acid sequence beginning at amino add 20, or are a 
transmembrane domain. Additional nucleotide sequence from the 3* portion of a522_l, 
including the polyA tail, is reported in SEQ ID NO:24. 

The EcoRI/NotI restriction fragment obtainable from the deposit containing clone 
CI522_1 should be approximately 1400 bp. 
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The nucleotide sequence disclosed herein for CI522_1 was searched against the 
GenBank and GeneSeq nucleotide sequence databases using BLASTN/BLASTX and 
FASTA search protocols. CI522_1 demonstrated at least some homology with sequences 
identified as AA028557 (mil8g05.rl Soares mouse p3NMF19.5 Mus musculus cDNA done 
5 463928 5'), H32238 (EST107136 Rattus sp. cDNA 5* end), T33525 (EST58140 Homo sapiens 
cDNA 5* end similar to None), U66468 (Human cell growth regxilator CGRll mKNA, 
complete cds), and X00525 (Mouse 28S ribosomal RNA). The predicted amino add 
sequence disdosed herein for CI522_1 was searched against the GenPept and GeneSeq 
amino add sequence databases using the BLASTX search protocol. The predicted CI522_1 
10 protein demonstrated at least some identity with sequences identified as U66468 (cell 
growth regulator CGRll [Homo sapiens]). Based upon homology, CI522_1 proteins and 
each homologous protein or peptide may share at least some activity. 

Clone "CN238 1" 

15 A polynucleotide of the present invention has been identified as done "CN238„1 

CN238_1 was isolated from a human fetal brain cDNA library using methods which are 
selective for cDNAs encoding secreted proteins (see U.S. Pat. No. 5^6,637), or was 
identified as encoding a secreted or transmembrane protein on the basis of computer 
analysis of the amino acid sequence of the encoded protein. CN238_1 is a full-length 

2 0 done, including the entire coding sequence of a secreted protein (also referred to herein 
as "CN238„1 protein"). 

The nucleotide sequence of CN238_1 as presently determined is reported in SEQ 
ID NO:25. What applicants presently believe to be the proper reading frame and the 
predicted amino acid sequence of the CN238_1 protein corresponding to the foregoing 

2 5 nucleotide sequence is reported in SEQ ID NO:26. 

The nucleotide sequence disdosed herein for CN238_1 was searched against the 
GenBank and GeneSeq nudeotide sequence databases using BLASTN/BLASTX and 
FASTA search protocols. CN238_1 demonstrated at least some homology wi* sequences 

3 0 identified as AA044097 (zk51b02.rl Soares pregnant uterus NbHPU Homo sapiens cDNA 

clone 486315 5'), AA044287 (zk51b02.sl Soares pregnant uterus NbHPU Homo sapiens 
cDNA clone 486315 3'), AA045440 (zk67c03.sl Soares pregnant uterus NbHPU Homo 
sapiens cDNA done 487876 3'), AA143007 (zl48f01.rl Soares pregnant uterus NbHPU 
Homo sapiens cDNA clone 505177 5*), D51196 (Hviman fetal brain cDNA 3'-end 
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GEN-016G05), D60310 (Human fetal brain cDNA 3'-end GEN-098A09), N69344 
(yz43e04.sl Homo sapiens cDNA clone 285822 3' similar to gb:K00558 TUBULIN 
ALPHA-1 CHAIN (HUMAN)), W22250 (64B8 Human retina cDNA Tsp509I-cleaved 
sublibrary Homo), and X01703 (Human gene for alpha-tubulin (b alpha 1)). The predicted 
5 amino acid sequence disclosed herein for CN238_1 was searched against the GenPept and 
GeneSeq amino acid sequence databases using the BLASTX search protocol. The 
predicted CN238_1 protein demonstrated at least some identity witih sequences identified 
as K00557 (alpha-tubulin [Homo sapiens]) and U51583 (zinc finger homeodomain 
enhancer-binding protein-1 [Rattus norvegicus]). Based upon homology, CN238_1 
1 0 proteins and each homologous protein or pepHde may share at least some activity. 

Clone "CO390 1" 

A polynucleotide of the present invention has been identified as clone "CG390_1". 
CO390_l was isolated horn a human adult brain cDNA library using methods which are 
15 selective for cDNAs encoding secreted proteins (see U.S. Pat. No. 5,536,637), or was 
identified as encoding a secreted or transmembrane protein on the basis of computer 
analysis of the amino acid sequence of the encoded protein. CO390_l is a hiU-lengtf\ 
clone, including the entire coding sequence of a secreted protein (also referred to herein 
as "CO390_l protein"). 

20 The nucleotide sequence of CO390_l as presently determined is reported in SEQ 

ID NO:27. What applicants presently believe to be the proper reading fi-ame and the 
predicted amino acid sequence of the CO390_l protein corresponding to the foregoing 
nucleotide sequence is reported in SEQ ID NO:28. 

The EcoRI/NotI restiiction firagment obtainable from tiie deposit containing clone 

2 5 CO390_l should be approximately 2300 bp. 

The nucleotide sequence disclosed herein for CO390_l was searched against the 
CenBank and GeneSeq nucleotide sequence databases using BLASTN/BLASTX and 
FASTA search protocols. CO390_l demonstrated at least some homology with sequences 
identified as H84353 (yvSSall.rl Homo sapiens cDNA done 249500 5'), L35532 (Pan 
30 h-oglodytes Alu repeat region), N80616 (Genomic clone encoding SAP(Phe)), R53922 
(yi03hl0.sl Homo sapiens cDNA clone 138211 3' similar to contains Alu repetitive 
element;contains TARl repetitive element), X75335 (H.sapiens Alu insertion in COL3A1 
gene), X95882 (R.norvegicus mRNA for ATP ligand gated ion channel), and Y09561 
(H.sapiens mRNA for P2X7 receptor). The predicted amino add sequence disclosed 
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herein for CO390_l was searched against the GenPept and GeneSeq amino acid sequence 
databases using the BLASTX search protocol. The predicted CO390_l protein 
demonstrated at least some identity with sequences identified as U45448 (P2xl receptor 
[Homo sapiens]), W04216 (Rat superior cervical ganglion p2x receptor), X83688 (ATP 
5 receptor [Homo sapiens]), X95882 (P2X7 gene product [Rattus norvegicus]), and Y09561 
(ATP receptor [Homo sapiens]). Based upon homology, CO390_l proteins and each 
homologous protein or peptide may share at least some activity. The TopPredll computer 
program predicts a potential transmembrane domain within the CO390_l protein 
sequence, centered aroimd amino acid 249 of SEQ ID NO:28. The nucleotide sequence of 
1 0 CO390_l may contain an Alu repetitive element. 

Deposit of Clones 

Clones AR415_4, AS63_29, BG160_1, B0432,4, B0538_2, BR595_4, CI490_2, 
CI522_1, CN238_1, CO390_l, and AY304_1 (an additional isolate of clone AY304_14) were 

15 deposited on October 25, 1996 with the American Type Culture Collection as an original 
deposit under the Budapest Treaty and were given the accession number ATCC 98232, 
from which each clone comprising a particular polynucleotide is obtainable. Clone 
AY304_14 wasdeposited on October 23, 1997 with the American Type Ctilture Collection 
as an original deposit under the Budapest Treaty and was given the accession number 

20 ATCC xxxxx. All restrictions on the availability to the public of the deposited material 
will be irrevocably removed upon the granting of the patent, except for the requirements 
specified in 37 C.RR. § 1.808(b). 

Each clone has been transfected into separate bacterial cells (E. coli) in this 
composite deposit. Each done can be removed from the vector in which it was deposited 

25 by performing an EcoRI/NotI digestion (5' site, EcoRI; 3* site, NotI) to produce the 
appropriate fragment for such done. Each done was deposited in either the pED6 or 
pNOTs vector depicted in Fig. 1. The pED6dpc2 vector ("pED6") was derived from 
pED6dpcl by insertion of a new polylinker to facilitate cDNA cloning (Kaufman et aL, 
1991, Nucleic Acids Res. 19: 4485-4490); the pNOTs vector was derived from pMT2 

30 (Kauftnan et aL, 1989, MoL Cell Biol 9: 946-958) by deletion of the DHFR sequences, 
insertion of a new polylinker, and insertion of the M13 origin of replication in the Clal site. 
In some instances, the deposited clone can become "flipped" (i.e., in the reverse 
orientation) in the deposited isolate. In such instances, the cDNA insert can stDl be 
isolated by digestion with EcoRI and NotL However, NotI will then produce the 5' site 
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and EcoRI will produce the 3* site for placement of the cDNA in proper orientation for 
expression in a suitable vector. The cDNA may also be expressed from the vectors in 
which they were deposited. 

Bacterial cells containing a particular done can be obtained from the composite 
5 deposit as follows: 

An oligonucleotide probe or probes should be designed to the sequence that is 
known for that particular done. This sequence can be derived from the sequences 
provided herein, or from a combination of those sequences. The sequence of the 
oHgonudeotide probe that was used to isolate each full-length done is identified below, 
10 and should be most reliable in isolating the clone of interest. 



Clone 


Probe Sequence 


AR415_4 


SEQ ID NO:29 


AS63_29 


SEQIDNO:30 


15 AY304_14 


SEQIDNO:31 


BG160_1 


SEQroNO:32 


B0432_4 


SEQIDNO:33 


B0538_2 


SEQIDNO:34 


BR595_4 


SEQIDNO:35 


20 a490_2 


SEQIDNO:36 


a522_l 


SEQroNO:37 


CN238_1 


SEQIDNO:38 


CO390_l 


SEQIDNO:39 



In the sequences listed above which indude an N at position 2, that position is occupied 
in preferred probes/primers by a biotinylated phosphoaramidite residue rather than a 
nudeotide (such as , for example, that produced by use of biotin phosphoramidite (1- 

dimethoxytrityloxy-2-(N-biotinyl-4-aminobutyl)-propyl-3-0-(2-cyanoethyl)-{N,N- 
dusopropyl)-phosphoramadite) (Glen Research, cat. no. 10-1953)). 

The design of the oligonudeotide probe should preferably follow these 
parameters: 

(a) It should be designed to an area of the sequence which has the fewest 
ambiguous bases ("N's'*), if any; 
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(b) It should be designed to have a T„ of approx. SO°C (assuming 2** for each 
A or T and 4 degrees for each G or C). 
The oligonucleotide should preferably be labeled with g-^^P ATP (specific activity 6000 
Ci/mmole) and T4 polynucleotide kinase using commonly employed techniques for 
5 labeling oligonucleotides. Other labeling techniques can also be used. Unincorporated 
label should preferably be removed by gel filtration chromatography or other established 
methods. The amount of radioactivity incorporated into the probe should be quantitated 
by measurement in a scintillation counter. Preferably, specific activity of the resulting 
probe should be approximately 4e+6 dpm/pmole. 
10 The bacterial culture containing the pool of full-length clones should preferably 

be thawed and 100 pi of the stock used to inoculate a sterile culture flask containing 25 ml 
of sterile L-broth containing ampicillin at 100 pg/mL The culture should preferably be 
grown to saturation at 37°C, and the saturated culture should preferably be diluted in 
fresh L-broth. Aliquots of these dilutions should preferably be plated to determine the 
15 dilution and volume which will yield approximately 5000 distinct and well-separated 
colonies on solid bacteriological media containing L-broth containing ampicillin at 100 
pg/ml and agar at 1.5% in a 150 mm petri dish when grown overnight at 37°C. Other 
known methods of obtaining distinct, well-separated colonies can also be employed. 

Standard colony hybridization procedures should then be used to transfer the 

2 0 colonies to nitrocellulose filters and lyse, denature and bake them. 

The filter is then preferably incubated at 65°C for 1 hour with gentle agitation in 
6X SSC (20X stock is 175.3 g NaCl/liter, 88.2 g Na citrate/liter, adjusted to pH 7.0 with 
NaOH) containing 0.5% SDS, 100 pg/ml of yeast RNA, and 10 mM EDTA (approximately 
10 mL per 150 mm filter). Preferably, the probe is then added to the hybridization mix at 
25 a concentration greater than or equal to le+6 dpm/mL. The filter is then preferably 
incubated at 65°C with gentle agitation overnight. The filter is then preferably washed in 
500 mL of 2X SSC/0.5% SDS at room temperature without agitation, preferably followed 
by 500 mL of 2X SSC/0.1% SDS at room temperature with gentle shaking for 15 minutes. 
A third wash with O.IX SSC/0.5% SDS at 65°C for 30 minutes to 1 hour is optional. The 

3 0 filter is then preferably dried and subjected to autoradiography for sufficient time to 

visualize the positives on the X-ray film. Other known hybridization methods can also 
be employed. 
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The positive colonies are picked, grown in culture, and plasmid DNA isolated 
using standard procedures. The clones can then be verified by restriction analysis, 
hybridization analysis, or DNA sequencing. 

Fragments of the proteins of the present invention which are capable of exhibiting 
5 biological activity are also encompassed by the present invention. Fragments of the 
protein may be in linear form or they may be cydized using known methods, for example, 
as described in H.U Saragovi, et al, Bio/Technology 10, 773-778 (1992) and in R.S. 
McDowell, et al, J. Amer. Chem. Soc. 114, 9245-9253 (1992), both of which are incorporated 
herein by reference. Such fragments may be fused to carrier molecules such as 
1 0 immunoglobulins for many purposes, including increasing the valency of protein binding 
sites. For example, fragments of the protein may be fused through "linker" sequences to 
the Fc portion of an immunoglobulin. For a bivalent form of the protein, such a fusion 
could be to the Fc portion of an IgG molecule. Other immunoglobulin isotypes may also 
be used to generate such fusions. For example, a protein - IgM fusion would generate a 
1 5 decavalent form of the protein of the invention. 

The present invention also provides both full-length and mature forms of the 
disclosed proteins. The full-length form of the such proteins is identified in the sequence 
listing by translation of the nucleotide sequence of each disclosed done. The mature form 
of such protein may be obtained by expression of the disdosed full-length polynudeotide 
2 0 (preferably those deposited with ATCC) in a suitable mammalian cell or other host cell. 
The sequence of the mature form of the protein may also be determinable from the amino 
acid sequence of the full-length form. 

The present invention also provides genes corresponding to the cDNA sequences 
disdosed herein. "Corresponding genes" are the regions of the genome that are 

2 5 transcribed to produce the mRNAs from which the cDNA sequences are derived and any 

contiguous regions of the genome necessary for the regulated expression of such genes, 
induding but not limited to coding sequences, 5' and 3' untranslated regions, alternatively 
spliced exons, introns, promoters, enhancers, and silencer or suppressor elements. The 
corresponding genes can be isolated in accordance with knovm methods using the 

3 0 sequence iiiformation disdosed herein. Such methods include the preparation of probes 

or primers from the disclosed sequence information for identification and/or 
amplification of genes in appropriate genomic libraries or other sources of genomic 
materials. 
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Where the protein of the present invention is membrane-bound (e.g., is a receptor), 
the present invention also provides for soluble forms of such protein. In such forms part 
or all of the intraceUular and transmembrane domains of the protein are deleted such that 
the protein is fully secreted from the cell in which it is expressed. The intracellular and 
5 transmembrane domains of proteins of the invention can be identified in accordance with 
known techniques for determination of such domains from sequence information. 

Proteins and protein fragments of the present invention include proteins with 
amino acid sequence lengths that are at least 25%(more preferably at least 50%, and most 
preferably at least 75%) of the length of a disclosed protein and have at least 60% sequence 
10 identity (more preferably, at least 75% identity; most preferably at least 90% or 95% 
identity) with that disclosed protein, where sequence identity is determined by comparing 
the amino acid sequences of the proteins when aligned so as to maximize overlap and 
identity while minimizing sequence gaps. Also included in the present invention are 
proteins and protein fragments that contain a segment preferably comprising 8 or more 
15 (more preferably 20 or more, most preferably 30 or more) contiguous amino adds that 
shares at least 75% sequence identity (more preferably, at least 85% identity; most 
preferably at least 95% identity) with any such segment of any of the disclosed proteins. 

Species homologs of the disclosed polynucleotides and proteins are also provided 
by the present invention. Species homologs may be isolated and identified by making 
20 suitable probes or primers from the sequences provided herein and screening a suitable 
nucleic add source from the desired species. 

The invention also encompasses allelic variants of the disdosed polynucleotides 
or proteins; that is, naturally-occurring alternative forms of the isolated polynucleotide 
which also encode proteins which are identical, homologous or related to that encoded 
25 by the polynucleotides . 

The invention also includes polynudeotides with sequences complementary to 
those of the polynucleotides disclosed herein. 

The present invention also indudes polynucleotides capable of hybridizing 
under reduced stringency conditions, more preferably stringent conditions, and most 
3 0 preferably highly stringent conditions, to polynucleotides described herein. Examples of 
stringency conditions are shown in the table below: highly stringent conditions are those 
that are at least as stringent as, for example, conditions A-F; stringent conditions are at 
least as stringent as, for example, conditions G-L; and reduced stringency conditions are 
at least as stringent as, for example, conditions M-R. 
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25 



30 



35 



Stringency 
Condition 


Polynucleotide 
Hybrid 


Hybrid 
Length 


Hybridization Temperature and 
Buffer" 


Wash 

Temperature 
and Buffer' 


A 


DNA-DNA 


^n 


t» L.; IXDbC -or- 

42*'C; IxSSC, 50% formamide 


65^C; 0.3xSSC 




DNA-D>sLA 






Tb*; IxSSC 


c 


UJNA.KNA 


i 50 


67"C; IxSSC -or- 

45^0; IxSSC, 50% formamide 


67''C;0.3xSSC 


D 


DNARMA 


OU 


1d;1xSSC 


Tp*; IxSSC 


E 






70 C; IxSSC -or- 

SO°C; IxSSC 50% formamide 


70X;0.3xSSC 


F 




<5u 


Tp*; IxSSC 


Tp*; IxSSC 


Q 


DM A 'TiKl A 


s 50 


65°^* 4xSSC -or- 

42X; 4xSSC, 50% formamide 


65X; IxSSC 


11 


"DM A -DM A 


<50 


T„*; 4xSSC 


T„*;4xSSC 


1 


DNA:RNA 


250 


67X;4xSSC-or- 

45X; 4xSSC, 50% formamide 


67»C; IxSSC 


J 


L'INIA-KfMA 


<50 


Tj*; 4xSSC 


T,*;4xSSC 




PKT A .DKT A 


i 50 


70°C; 4xSSC -or- 

50X; 4xSSC, 50% formamide 


67°C; IxSSC 




PM A .OKT A 


<50 


Tl*; 2xSSC 


T,*; 2xSSC 


M 


T^KT A .T^KT A 

Ui\A:UNA 


2 50 


50X;4xSSC-or- 

40°C; 6xSSC 50% formamide 


50'^C;2xSSC 


N 


r^xr A .p*xi A 
UJ\A:L'NA 


<50 


Tn*; 6xSSC 


T^*; 6xSSC 


O 


DNA:RNA 


2 50 


55 *C' 4xSSr -nr- 

42 '^C; 6xSSQ 50% formamide 


ceo f\ 

DO C_;2xSbC 


P 


DNA:RNA 


<50 


Tp*; 6xSSC 


Tp*;6xSSC 


Q 


RNArRNA 


iSO 


60*C; 4xSSC-or- 

45 ^'C; 6xSSC, 50% formamide 


60°C;2xSSC 


R 


RNAiRNA 


<50 


Tr*;4xSSC 


Tr*;4xS5C 



: hybnd length is that antidpated for Ac hybridized region(s) of the hybridizing poJynucleotides. When 
hybndizing a polynucleotide to a target polynucleotide of unknown sequence, the hybrid length is assumed 
to be *at of the hybridizing polynudeoHde. When polynucleotides of known sequence are hybridized the 
hybrid Icngfo can be determined by aligning the sequences of the polynucleotides and identifying the region 
or regions of optimal sequence complementarity 

n!l?? L'^^'m n^*^ "^Jf i' NaH,PO., and 1.25mM EDTA, pH 7.4) can be substituted for SSC 

(IxSSC IS 0.15M NaQ and 15mM sodium citrate) in the hybridization and wash buffers; washes are 
performed for 15 minutes after hybridization is complete. 

7%' Tn'c!?? ^y^"^"^^^^ temperatare for hybrids anticipated to be less than 50 base pairs in length should 
be 5-10 C less than the melting temperature <TJ of the hybrid, where T„ is determined according to the 
followmg equations For hybrids less than 18 base pairs in length, TJ-Q = 2(# of A + T bases) + 4(# of G + 
/Ann /M ■ Z ^y^^'^^y^^^^ 18 ^"'^ 49 base pairs in length, T„(«C) = 81.5 + 16.6(log„INa1) + 0.41(%G+C) - 
{600/N), where N .s the number of bases in the hybrid, and (Na^J is the concentration of sodium ions in the 
hybndizaHon buffer ([Na*] for IxSSC = 0.165 M). 
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Additional examples of stringenq^ conditions for polynucleotide hybridization are 
provided in Sambrook, J., E.F. Fritsch, and T. Maniatis, 1989, Molecular Cloning: A 
Laboraton/ Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 
chapters 9 and 11, and Citrrent Protocols in Molecular Biology, 1995, F.M. Ausubel et al., eds., 
5 John Wiley & Sons, Inc., sections 2.10 and 6.3-6.4, incorporated herein by reference. 

Preferably, each such hybridizing polynucleotide has a length that is at least 
25%(more preferably at least 50%, and most preferably at least 75%) of the length of the 
polynucleotide of the present invention to which it hybridizes, and has at least 60% 
sequence identity (more preferably, at least 75% identity; most preferably at least 90% or 
10 95% identity) with the polynucleotide of the present invention to which it hybridizes, 
where sequence identity is determined by comparing the sequences of the hybridizing 
polynucleotides when aligned so as to maximize overlap and identity while minimizing 
sequence gaps. 

The isolated polynucleotide of the invention may be operably linked to an 
15 expression control sequence such as the pMT2 or pED expression vectors disclosed in 
Kaufman et al. Nucleic Adds Res. 19, 4485-4490 (1991), in order to produce the protein 
recombinantly. Many siaitable expression control sequences are known in the art. General 
methods of expressing recombinant proteins are also known and are exemplified in R. 
Kaufman, Methods in Enzymology 185, 537-566 (1990). As defined herein "operably 
2 0 linked" means that the isolated polynucleotide of the invention and an expression control 
sequence are situated within a vector or cell in such a way that the protein is expressed 
by a host cell which has been transformed (transfected) with the ligated 
polynucleotide/expression control sequence. 

A number of types of cells may act as suitable host cells for expression of the 

2 5 protein. Mammalian host cells include, for example, monkey COS cells, Chinese Hamster 

Ovary (CHO) cells, human kidney 293 cells, human epidermal A431 cells, human Colo205 
cells, 3T3 cells, CV-1 cells, other transformed primate cell lines, normal diploid ceUs, cell 
strains derived from in vitro culture of primary tissue, primary explants, HeLa cells, 
mouse L cells, BHK, HL-60, U937, HaK or Jurkat cells. 

3 0 Alternatively, it may be possible to produce the protein in lower eukaryotes such 

as yeast or in prokaryotes such as bacteria. Potentially suitable yeast strains include 
Saccharomyces cerevisiae, Schizosaccharomyces pombe, Kluyveromyces strains, Candida, or any 
yeast strain capable of expressing heterologous proteins. Potentially suitable bacterial 
strains include Escherichia coli. Bacillus subtilis, Salmonella iyphimurium, or any bacterial 



37 



15 



WO 98/17687 

PCT/US97/19590 

Strain capable of expressing heterologous proteins. If the protein is made in yeast or 
bacteria, it may be necessary to modify the protein produced therein, for example by 
phosphorylation or glycosylation of the appropriate sites, in order to obtain the functional 
protein. Such covalent attachments may be accomplished using known chemical or 
5 ertzsymatic methods. 

The protein may also be produced by operably linking the isolated polynucleotide 
of the invention to suitable control sequences in one or more insect expression vectors, 
and employing an insect expression system. Materials and methods for 
baculovirus/insect ceU expression systems are commercially avaflable in kit form from, 
10 e.g., Invitrogen, San Diego, California, U.S.A. (the MaxBac® kit), and such methods are 
weU known in the art, as described in Summers and Smith, Texas Agricultural Fvp .H^^. 
Station BuUeHn No. 1555 flW) , incorporated herein by reference. As used herein, an 
insect cell capable of expressing a polynucleotide of the present invention is 
"transformed." 

The protein of the invention may be prepared by culturing transformed host ceUs 
under culture conditions suitable to express the recombinant protein. The resulting 
expressed protein may then be purified from such culture (i.e., from culture medium or 
cell extracts) using known purification processes, such as gel filtraHon and ion exchange 
chromatography. The purification of the protein may also include an affinity column 
containing agents which will bind to the protein; one or more column steps over such 
affinity resins as concanavalin A-agarose, heparin-toyopearl® or Cibacrom blue 3GA 
Sepharose®; one or more steps involving hydrophobic interaction chromatography using 
such resins as phenyl ether, butyl ether, or propyl ether; or immunoaffinity 
chromatography. 

Alternatively, the protein of the invention may also be expressed in a form which 
will facilitate purification. For example, it may be expressed as a fusion protein, such as 
those of maltose binding protein (MBP), glutathione-S-transferase (GST) or thioredoxin 
(TRX). Kits for expression and purification of such fusion proteins are commercially 
available from New England BioLab (Beverly, MA), Pharmacia (Piscataway, NJ) and 
InVitrogen, respectively. The protein can also be tagged with an epitope and 
subsequently purified by using a specific antibody directed to such epitope. One such 
epitope ("Flag") is commercially available from Kodak (New Haven, CT). 

FinaUy, one or more reverse-phase high performance liquid chromatography (RP- 
HPLC) steps employing hydrophobic RP-HPLC media, e.g., silica gel having pendant 
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methyl or other aliphatic groups, can be employed to further purify the protein. Some or 
all of the foregoing purification steps, in various combinations, can also be employed to 
provide a substantially homogeneous isolated recombinant protein. The protein thus 
p\irified is substantially free of other mammalian proteins and is defined in accordance 
5 with the present invention as an "isolated protein." 

The protein of the invention may also be expressed as a product of transgenic 
animals, e.g., as a component of the milk of transgenic cows, goats, pigs, or sheep which 
are characterized by somatic or germ cells containing a nucleotide sequence encoding the 
protein. 

10 The protein may also be produced by known conventional chemical sjmthesis. 

Methods for constructing the proteins of the present invention by synthetic means are 
known to those skilled in the art. The synthetically-constructed protein sequences, by 
virtue of sharing primary, secondary or tertiary structural and /or conformational 
characteristics with proteins may possess biological properties in common therewith, 

15 including protein activity. Thus, they may be employed as biologically active or 
immunological substitutes for natural, purified proteins in screening of therapeutic 
compounds and in immunological processes for the development of antibodies. 

The proteins provided herein also include proteins characterized by amino acid 
sequences similar to those of purified proteins but into which modification are naturally 

2 0 provided or deliberately engineered. For example, modifications in the peptide or DNA 

sequences can be made by those skilled in the art using known techniques. Modifications 
of interest in the protein sequences may include the alteration, substitution, replacement, 
insertion or deletion of a selected amino acid residue in the coding sequence. For 
example, one or more of the cysteine residues may be deleted or replaced with another 
25 amino add to alter the conformation of the molecule. Techniques for such alteration, 
substitufion, replacement, insertion or deletion are well known to those skilled in the art 
(see, e.g., U.S. Patent No. 4,518,584). Preferably, such alteration, substitution, replacement, 
insertion or deletion retains the desired activity of the protein. 

Other fragments and derivatives of the sequences of proteins which would be 

3 0 expected to retain protein activity in whole or in part and may thus be useful for screening 

or other immimological methodologi^ may also be easily made by those skilled in the art 
given the disclosures herein. Such modifications are believed to be encompassed by the 
present invention. 
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USES AND BIOLOGICAL ArTTVTTV 

The polynucleotides and proteins of the present invention are expected to exhibit 
one or more of the uses or biological activities (including those associated with assays 
cited herein) identified below. Uses or activities described for proteins of the present 
S invention may be provided by administration or use of such proteins or by administration 
or use of polynucleotides encoding such proteins (such as. for example, in gene therapies 
or vectors suitable for introduction of DNA). 

Research Uses and Utilities 

The polynucleotides provided by the present invention can be used by the research 
community for various purposes. The polynucleotides can be used to express 
recombinant protein for analysis, characterization or therapeutic use; as markers for 
tissues in which tiie corresponding protein is preferentially expressed (either 
constitutively or at a particular stage of tissue differentiation or development or in disease 
1 5 states); as molecular weight markers on Southern gels; as chromosome markers or tags 
(when labeled) to identify chromosomes or to map related gene positions; to compare 
with endogenous DNA sequences in patients to identify potential genetic disorders; as 
probes to hybridize and thus discover novel, related DNA sequences; as a source of 
information to derive PCR primers for genetic fingerprinting; as a probe to "subtract-out" 
2 0 known sequences in the process of discovering other novel polynucleotides; for selecting 
and making oligomers for attachment to a "gene chip" or other support, including for 
examination of expression patterns; to raise anti-protein antibodies using DNA 
immunization techniques; and as an antigen to raise anti-DNA antibodies or elicit another 
immune response. Where the polynucleotide encodes a protein which binds or potentially 
25 binds to another protein (such as, for example, in a rcceptor-ligand interaction), the 
polynucleotide can also be used in interaction trap assays (such as, for example, that 
described in Gyuris et al.. Cell 75:791-803 (1993)) to identify polynucleotides encoding the 
other protein with which binding occurs or to identify inhibitors of the binding 
interaction. 

30 The proteins provided by the present invention can similarly be used in assay to 

determine biological activity, including in a panel of multiple proteins for high- 
throughput screening; to raise antibodies or to elicit another immune response; as a 
reagent (including the labeled reagent) in assays designed to quantitatively determine 
levels of the protein (or its receptor) in biological fluids; as markers for tissues in which 
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the corresponding protein is preferentially expressed (either constitutively or at a 
particular stage of tissue differentiation or development or in a disease state); and, of 
course, to isolate correlative receptors or ligands. Where the protein binds or potentially 
binds to another protein (such as, for example, in a receptor-ligand interaction), the 
5 protein can be used to identify the other protein with which binding occurs or to identify 
inhibitors of the binding interaction. Proteins involved in these binding interactions can 
also be used to screen for peptide or small molecule inhibitors or agonists of the binding 
interaction. 

Any or all of these research utilities are capable of being developed into reagent 
1 0 grade or kit format for commercialization as research products. 

Methods for performing the uses listed above are well known to those skilled in 
the art. References disclosing such methods include without limitation "Molecular 
Qoning: A Laboratory Manual", 2d ed.. Cold Spring Harbor Laboratory Press, Sambrook, 
J., E.F. Fritsch and T. Maniatis eds., 1989, and "Methods in Enzymology: Guide to 
1 5 Molecular Cloning Techniques", Academic Press, Berger, S.L. and A.R. Kimmel eds., 1987. 

Nutritional Uses 

Polynucleotides and proteins of the present invention can also be used as 
nutritional sources or supplements. Such uses include without limitation use as a protein 
20 or amino add supplement, use as a carbon source, use as a nitrogen source and use as a 
source of carbohydrate. In such cases the protein or polynucleotide of the invention can 
be added to the feed of a particular organism or can be administered as a separate solid 
or Uquid preparation, such as in the form of powder, pills, solutions, suspensions or 
capsules. In the case of microorganisms, the protein or poljmucleotide of the invention 

2 5 can be added to the medium in or on which the microorganism is cultured. 

Cytokine and Cell Proliferation /Differentiation Activity 

A protein of the present invention may exhibit cytokine, cell proliferation (either 
inducing or inhibiting) or cell differentiation (either inducing or inhibiting) activity or may 

3 0 induce production of other cytokines in certain cell populations. Many protein factors 

discovered to date, including aU known cytokines, have exhibited activity in one or more 
factor dependent cell proliferation assays, and hence the assays serve as a conveiuent 
confirmation of cytokine activity. The activity of a protein of the present invention is 
evidenced by any one of a number of routine factor dependent cell proliferation assays 
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for cell lines including, without limitation, 32D, DA2, DAIG, TIO, B9, B9/11, BaE3, 
MC9/G, M+ (preB M+), 2E8, RB5, DAI, 123, T1165, HT2, CTLL2, TI^l, Mo7e and CMK. 

The activity of a protein of the invention may, among other means, be measured 
5 by the following methods: 

Assays for T-ceD or thymocyte proliferation include without limitation those 
described in: Current Protocols in Immunology, Ed by J. E. Coligan, AJVI. Kruisbeek, D.H. 
Margulies, E.M. Shevach, W Strober, Pub. Greene Publishing Associates and WUey- 
Intersdence (Chapter 3, In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; Chapter 
10 7, Immunologic shidies in Humans); Takai et al., J. Immunol. 137:3494-3500, 1986; 

BertagnolU etal.,J.Immunol. 145:1706-1712, 1990;3ertagnolUetal.,CeUularImmunologji 
133:327-341, 1991; Bertagnolli, et al., J. Immunol. 149:3778-3783, 1992; Bowman et al., J. 
Immunol. 152: 1756-1761, 1994. 

Assays for cytokine production and/or proliferation of spleen cells, lymph node 
15 cells or thymocytes include, without limitation, those described in: Polyclonal T cell 
stimulation, Kruisbeek, A.M. and Shevach, E.M. In Currenl Protocols in Immunology. J.E.e.a. 
Coligan eds. Vol 1 pp. 3.12.1-3.12.14, John Wiley and Sons, Toronto. 1994; and 
Measurement of mouse and human Interferon y, Schreiber, R.D. In Current Protocols in 
Immimolog},.lE.e.a. Coligan eds. Vol 1 pp. 6.8.1-6.8.8, John Wiley and Sons, Toronto. 1994. 
2 0 Assays for proliferation and differentiaHon of hematopoietic and lymphopoietic 

cells include, without limitation, those described in: Measurement of Human and Murine 
Interleukin 2 and Interleukin 4, Bottomly, K., Davis, L3. and Lipsky, P.E. In Current 
Prolocok in Immunology. J.E.e.a. Coligan eds. Vol 1 pp. 6.3.1-6.3.12, John Wiley and Sons, 
Toronto. 1991; deVries et al., J. Exp. Med. 173:1205-1211, 1991; Moreau et al., Nahire 
2 5 336:690-692, 1988; Greenberger et al, Proc. Natl. Acad. Sci. U.S.A. 80:2931-2938, 1983; 
Measurement of mouse and human interleukin 6 - Nordan, R. In Current Protocols in 
Immunology. J.E.e.a. Coligan eds. Vol 1 pp. 6.6.1-6.6.5, John Wiley and Sons, Toronto. 1991; 
Smith et al., Proc. Natl. Acad. Sci. U.S.A. 83:1857-1861, 1986; Measurement of human' 
Interleukin 11 - Bennett, P., Giannotti, J., Qark, S.C. and Turner, K. J. In Current Protocols 
30 mJmm«„oZo^.J.Ee.a. Coligan eds. Vol 1 pp.6.15.1 JohnWiley and Sons, Toronto. 1991; 
Measurement of mouse and human Interleukin 9 - Ciarletta, A., Giannotti, J., Clark, S.C. 
and Turner, KJ. In Current Protocols in Immunology. J.E.e.a. Coligan eds. Vol 1 pp. 6.13.1, 
John Wiley and Sons, Toronto. 1991. 
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Assays for T-cell clone responses to antigens (which will identify, among others, 
proteins that affect APC-T cell interactions as well as direct T-cell effects by meastiring 
proliferation and cytokine production) include, without limitation, those described in: 
Current Protocols in Immunology, Ed by J. E. Coligan, A.M. Kruisbeek, D Ji. Margulies, 
5 E;M. Shevach, W Strober, Pub. Greene Publishing Associates and Wiley-Intersdence 
(Chapter 3, In Vitro assays for Mouse Lymphocyte Function; Chapter 6, Cytokines and 
their cellular receptors; Chapter 7, Immunologic studies in Humans); Weinberger et al., 
Proc. Natl. Acad. Sci. USA 77:6091-6095, 1980; Weinberger et aL, Eur. J. Immun. 
11:405-411, 1981; Takai et aL, J. Immunol. 137:3494-3500, 1986; Takai et al., J. Immunol. 
10 140:508-512, 1988. 

Immune Stimulating or Suppressing Activity 

A protein of the present invention may also exhibit immune stimulating or 
immune suppressing activity, including without limitation the activities for which assays 

15 are described herein. A protein may be useful in the treatment of various immune 
deficiencies and disorders (including severe combined immunodeficiency (SCID)), e.g., 
in regulating (up or down) growth and proliferation of T and /or B lymphocytes, as well 
as effecting the cytolytic activity of NK cells and other cell populations. These immime 
deficiencies may be genetic or be caused by viral (e.g., HIV) as well as bacterial or fungal 

20 infections, or may result from autoimmune disorders. More specifically, infectious 
diseases causes by viral, bacterial, fungal or other infection may be treatable using a 
protein of the present invention, including infections by HIV, hepatitis viruses, 
herpesviruses, mycobacteria, Leishmarua spp., malaria spp. and various fungal infections 
such as candidiasis. Of course, in this regard, a protein of the present invention may also 

25 be useful where a boost to the immtme system generally may be desirable, i.e., in the 
treatment of cancer. 

Autoimmune disorders which may be treated using a protein of the present 
invention include, for example, connective tissue disease, multiple sclerosis, systemic 
lupus erythematosus, rheumatoid arthritis, autoimmune pulmonary inflammation, 
3 0 Guillain-Barre syndrome, autoimmune thyroiditis, insulin dependent diabetes mellitis, 
myasthenia gravis, graft-versus-host disease and autoimmime inflammatory eye disease. 
Such a protein of the present invention may also to be useful in the treatment of allergic 
reactions and conditions, such as asthma (particularly allergic asthma) or other respiratory 
problems. Other conditions, in which immune suppression is desired (including, for 
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example, organ transplantation), may also be treatable using -a protein of the present 
invention. 

Using the proteins of the invention it may also be possible to immune responses, 
in a number of ways. Down regulation may be in the form of inhibiting or blocking an 

5 immune response already in progress or may involve preventing the induction of an 
immune response. The functions of activated T cells may be inhibited by suppressing T 
cell responses or by inducing specific tolerance in T cells, or both. Immunosuppression 
of T cell responses is generally an active, non-antigen-spedfic, process which requires 
continuous exposure of the T ceUs to the suppressive agent Tolerance, which involves 

) inducing non-responsiveness or anergy in T cells, is distinguishable from 
immunosuppression in that it is generally antigen-spedfic and persists after exposure to 
the tolerizing agent has ceased. Operationally, tolerance can be demonstrated by the lack 
of a T cell response upon reexposure to specific antigen in the absence of the tolerizing 
agent. 

Down regulating or preventing one or more antigen functions (including without 
limitation B lymphocyte antigen functions (such as , for example, B7)), e.g., preventing 
high level lymphokine synthesis by activated T cells, will be useful in situations of tissue, 
skin and organ transplantation and in graft-versus-host disease (GVHD). For example, 
blockage of T cell function should result in reduced tissue destruction in tissue 
transplantation. TypicaUy, in tissue b-ansplants, rejection of the transplant is initiated 
through its recognition as foreign by T cells, followed by an immune reaction that destroys 
the ti-ansplant. The administration of a molecule which inhibits or blocks interaction of 
a B7 lymphocyte antigen with its natural ligand(s) on immune cells (such as a soluble, 
monomeric form of a peptide having B7-2 activity alone or in conjunction with a 
monomeric form of a peptide having an activity of another B lymphocyte antigen (e.g., B7- 
1, B7-3) or blocking antibody), prior to ti-ansplantation can lead to the binding of the 
molecule to the nahjral Iigand(s) on the immune cells without transmitting the 
corresponding costimulatoiy signal. Blocking B lymphocyte antigen function in this 
matter prevents cytokine synthesis by immune cells, such as T cells, and thus acts as an 
immunosuppressant. Moreover, the lack of costimulation may also be sufficient to 
anergize the T cells, thereby inducing tolerance in a subject. Induction of long-term 
tolerance by B lymphocyte antigen-blocking reagents may avoid the necessity of repeated 
administi-ation of these blocking reagents. To achieve sufficient immunosuppression or 



44 



wo 98/17687 



PCT/US97/195W) 



tolerance in a subject, it may also be necessary to block the function of a combination of 
B Ijonphocyte antigens. 

The efficacy of particular blocking reagents in preventing organ transplant 
rejection or GVHD can be assessed using animal models that are predictive of efficacy in 
5 humans. Examples of appropriate systems which can be used include allogeneic ceirdiac 
grafts in rats and xenogeneic pancreatic islet cell grafts in mice, both of v^hich have been 
used to examine the immxmosuppressive effects of CTLA41g fusion proteins in vivo as 
described in Lenschow et al, Science 257:789-792 (1992) and Turka et al, Proc. Natl. Acad. 
Sci USA, 89:11102-11105 (1992). In addition, murine models of GVHD (see Paul ed., 
10 Fundamental Immimology, Raven Press, New York, 1989, pp. 846-847) can be used to 
determine the effect of blocking B lymphocyte antigen function in vivo on the development 
of that disease. 

Blocking antigen function may ako be therapeutically useful for treating 
autoimmune diseases. Many autoimmime disorders are the result of inappropriate 

1 5 activation of T cells that are reactive against self tissue and which promote the production 
of cytokines and autoantibodies involved in the pathology of the diseases. Preventing the 
activation of autoreactive T cells may reduce or eliminate disease symptoms. 
Administration of reagents which block costimulation of T cells by disrupting 
receptor:ligand interactions of B lymphocyte antigens can be used to inhibit T cell 

20 activation and prevent production of autoantibodies or T cell-derived cytokines which 
may be involved in the disease process. Additionally, blocking reagents may induce 
antigen-spedfic tolerance of autoreactive T cells which could lead to long-term relief from 
the disease. The efficacy of blocking reagents in preventing or alleviating autoimmune 
disorders can be determined using a number of well-characterized animal models of 

25 human autoimmune diseases. Examples include murine experimental autoimmime 
encephalitis, systemic lupus erythmatosis in MKL/lpr/lpr mice or NZB hybrid mice, 
murine autoimmime coUagen arthritis, diabetes mellitus in NOD mice and BB rats, and 
murine experimental myasthenia gravis (see Paul ed.. Fundamental Immunology, Raven 
Press, New York, 1989, pp. 840-856). 

3 0 Upregxilation of an antigen function (preferably a B lymphocyte antigen function), 

as a means of up regulating immune responses, may also be useful in therapy. 
Upregulation of immune responses may be in the form of enhancing an existing immune 
response or eliciting an initial immune response. For example, enhancing an immune 
response through stimulating B lymphocyte antigen function may be useful in cases of 
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viral infection. In addition, systemic viral diseases such as influenza, the common cold, 
and encephalitis might be alleviated by the administration of stimulatory forms of B 
lymphocyte antigens systemically. 

Alternatively, anti-viral immune responses may be enhanced in an infected patient 
5 by removing T cells from the patient, costimulating the T ceUs in vitro with viral antigen- 
pulsed APCs either expressing a peptide of the present invention or together with a 
stimulatory form of a soluble peptide of the present invention and reintroducing the in 
vitro activated T cells into the patient. Another method of enhancing anti-viral hnmune 
responses would be to isolate infected cells from a patient, transfect them with a nudeic 
1 0 add encoding a protein of the present invention as described herein such that the ceUs 
express all or a portion of the protein on their surface, and reintroduce the transfected 
cells mto the patient. The mfected cells would now be capable of delivering a 
costimulatory signal to, and thereby activate, T cells m i;zz;a. 

In another application, up regulation or enhancement of antigen function 
15 (preferably B lymphocyte antigen function) may be useful in the induction of tumor 
immunity. Tumor cells (e.^., sarcoma, melanoma, lymphoma, leukemia, neuroblastoma, 
carcinoma) tiransfected with a nucleic acid encoding at least one peptide of the present 
invention can be administered to a subject to overcome tumor-spedfic tolerance in the 
subject If desired, the tumor ceU can be transfected to express a combination of peptides. 
20 For example, tiimor cells obtained from a patient can be tiransfected ex vivo with an 
expression vector directing the expression of a peptide having B7-2-Uke activity alone, or 
in conjunction with a peptide having B7-l-like activity and/or B7-3-like activity. The 
transfected tiamor cells are rehirned to the patient to result in expression of the peptides 
on tiie surface of tiie transfected ceU. Alternatively, gene therapy techniques can be used 
25 to target a tumor cell for transfection in vivo. 

The presence of the peptide of the present invention having the activity of a B 
lymphocyte antigen(s) on the surface of the tumor cell provides the necessary 
costimulation signal to T cells to induce a T cell mediated immune response against the 
transfected tumor ceDs. In addition, tumor ceUs which lack MHC dass I or MHC dass H 
3 0 molecules, or which fail to reexpress suffident amounts of MHC class I or MHC dass H 
molecules, can be tiransfected with nudeic add encoding all or a portion of {e.g., a 
cytoplasmic-domain tiruncated portion) of an MHC class I a chain protein and p2 
microglobulin protein or an MHC dass II a chain protein and an MHC class H P chain 
protein to thereby express MHC dass I or MHC class II proteins on the cell surface. 
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Expression of the appropriate class I or class 11 MHC in conjunction with a peptide having 
the activity of a B lymphocyte antigen (e.g., B7-1, B7-2, B7-3) induces a T cell mediated 
immune response against the transfected tumor cell. Optionally, a gene encoding an 
antisense coristruct which blocks expression of an MHC class II associated protein, such 
5 asAe invariant chain, can also be cotransfected with a DNA encoding a peptide having 
the activity of a B lymphocyte antigen to promote presentation of tumor associated 
antigeris and induce tumor specific immunity. Thus, the induction of a T cell mediated 
immune response in a human subject may be sufficient to overcome ttimor-spedfic 
tolerance in the subject. 

1 0 The activity of a protein of the invention may, among other means, be measured 

by the follovdng methods: 

Suitable assays for thymocyte or splenocyte cytotoxicity include, without 
limitation, those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A.M. 
Kruisbeek, D.H. Margulies, E.M. Shevach, W Strober, Pub. Greene Publishing Associates 

1 5 and Wiley-Intersdence (Chapter 3, In Vitro assays for Mouse Lymphocyte Fimction 3.1- 
3.19; Chapter 7, Immimologic studies in Humai\s); Herrmann et al., Proc. Natl. Acad. Sci. 
USA 78:2488-2492, 1981; Herrmann et al., J. Immunol. 128:1968-1974, 1982; Handa et al., 
J. Immunol. 135:1564-1572, 1985; Takai et al., J. Immunol. 137:3494-3500, 1986; Takai et al., 
J. Immunol. 140508-512, 1988; Herrmann et al., Proc. Natl. Acad. Sci. USA 78:2488-2492, 

20 1981; Herrmann et al., J. Immtmol. 128:1968-1974, 1982; Handa et al, J. Immimol. 
135:1564-1572, 1985; Takai et al., J. Immunol. 137:3494-3500, 1986; Bowmanet al., J. 
Virology 61:1992-1998; Takai et al., J. Immunol. 140:508-512, 1988; BertagnolU et al., 
CeDular Immunology 133:327-341, 1991; Brown et al., J. Immunol 153:3079-3092, 1994. 
Assays for T-cell-dependent immunoglobulin responses and isotype switching 

25 (which will identify, among others, proteins that modulate T-cell dependent antibody 
responses and that affect Thl /Th2 profiles) include, without limitation, those described 
in: Maliszewski, J, Immunol. 144:3028-3033, 1990; and Assays for B cell function: In vitro 
antibody production, Mond, J.J. and Brunswick, M. In Current Protocols in Immunology. 
J.E.e.a. Coligan eds. Vol 1 pp. 3.8.1-3.8.16, John Wiley and Sons, Toronto. 1994. 

3 0 Mixed lymphocyte reaction (MLR) assays (which will identify, among others, 

proteins that generate predominantly Thl and CTL responses) include, v^rithout limitation, 
those described in: Current Protocols in Immxmology, Ed by J. E. Coligan, A.M. Kruisbeek, 
D.H. Margulies, E.M. Shevach, W Strober, Pub. Greene Publishing Associates and Wiley- 
Intersdence (Chapter 3, In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; Chapter 
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1. Immunologic studies in Humans); Takai et al., J. Immunol. 137:3494-3500, 1986; Takai 
et al., J. Immunol. 140508-512, 1988; Bertagnolli et al., J. Immunol. 149:3778-3783, 1992. 

Dendritic cell-dependent assays (which will identiiy, among others, proteins 
expressed by dendritic cells that activate naive T-cells) include, without limitation, those 
5 described in: Guery et al., J. Immunol. 134:536-544, 1995; Inaba et al., Journal of 
Experimental Medicine 173:549-559, 1991; Macatonia et al.. Journal of Immunology 
1545071-5079, 1995; Porgador et al.. Journal of Experimental Medicine 182:255-260, 1995; 
Nair et al.. Journal of Virology 67:4062-4069, 1993; Huang et al.. Science 264:961-965, 
1994; Macatonia et al.. Journal of Experimental Medidne 169:1255-1264, 1989; Bhardwaj 
10 et al.. Journal of Clinical Investigation 94:797-807, 1994; and Inaba et al.. Journal of 
Experimental Medidne 172:631-640, 1990. 

Assays for lymphocyte survival/apoptosis (which will identify, among others, 
proteins that prevent apoptosis after superantigen induction and proteins that regulate 
lymphocyte homeostasis) indude, without limitation, those described in: Darzynkiewic/. 
15 et al.. Cytometry 13:795-808, 1992; Gorczyca et al.. Leukemia 7:659-670, 1993; Gorczyca et 
al.. Cancer Research 53:1945-1951, 1993; Itoh et al.. Cell 66:233-243, 1991; Zacharchuk, 
Journal of Immunology 145:4037-4045, 1990; Zamai et al.. Cytometry 14:891-897, 1993; 
Gorczyca et al.. International Journal of Oncology 1:639-648, 1992. 

Assays for proteins that influence early steps of T-cell commitment and 
20 development indude, without limitation, those described in: Antica et al.. Blood 
84:111-117, 1994; Fine et al.. Cellular Immunology 155:111-122, 1994; Galy et al.. Blood 
85:2770-2778, 1995; Toki et al., Proc. Nat. Acad Sd. USA 88:7548-7551, 1991. 

Hematopoiesis Regulating Activity 

25 A protein of the present invention may be useful in regulation of hematopoiesis 

and, consequently, in the treatment of myeloid or lymphoid ceD deficiendes. Even 
marginal biological activity in support of colony forming cells or of factor-dependent cell 
lines indicates involvement in regulating hematopoiesis, e.g. in supporting the growth and 
proliferation of erythroid progenitor cells alone or in combination with other cytokines, 

3 0 thereby indicating utility, for example, in treating various anemias or for use in 
conjunction with irradiation/chemotherapy to stimulate the produrtion of erythroid 
precursors and/or erythroid cells; in supporting the growth and proliferation of myeloid 
cdls such as granulocytes and monocytes/macrophages (i.e., traditional CSF activity) 
useful, for example, in conjunction with diemotherapy to prevent or treat consequent 
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myelo-suppression; in supporting the growth and proliferation of megakaryocytes and 
consequently of platelets thereby allowing prevention or treatment of various platelet 
disorders such as thrombocytopenia, and generally for use in place of or complimentary 
to platelet transfusions; and /or in supporting the growth and proliferation of 
5 hematopoietic stem cells which are capable of maturing to any and all of the above- 
mentioned hematopoietic cells and therefore find therapeutic utility in various stem cell 
disorders (such as those usually treated with transplantation, including, without 
limitation, aplastic anemia and paroxysmal nocturnal hemoglobinuria), as well as in 
repopulating the stem cell compartment post irradiation/chemotherapy, either in-vivo or 

10 eX'Vivo (i.e., in conjxmction with bone marrow transplantation or with peripheral 
progenitor cell transplantation (homologous or heterologous)) as normal cells or 
genetically manipulated for gene therapy. 

The activity of a protein of the invention may, among other meaiis, be measured 
by the following methods: 

15 Suitable assays for proliferation and differentiation of various hematopoietic lines 

are cited above. 

Assays for embryonic stem cell differentiation (which will identify, among others, 
proteins that influence embryonic differentiation hematopoiesis) include, without 
limitation, those described in: Johansson et al. Cellular Biology 15:141-151, 1995; Keller et 
20 al. Molecular and Cellular Biology 13:473-486, 1993; McClanahan et aL, Blood 
81:2903-2915, 1993. 

Assays for stem cell survival and differentiation (which will identify, among 
others, proteins that regulate lympho-hematopoiesis) include, without limitation, those 
described in: Methylcellulose colony forming assays, Freshney, M.G. In Culture of 

2 5 Hematopoietic Cells. R J. Freshney, et a/, eds. Vol pp. 265-268, Wiley-Liss, Inc., New York, 
NY. 1994; Hirayama et al., Froc. Natl. Acad. Sd. USA 89:5907-5911, 1992; Primitive 
hematopoietic colony forming cells with high proliferative potential, McNiece, I.K. and 
Briddell, R.A. In Culture of Hematopoietic Cells. R.L Freshney, et al eds. Vol pp. 23-39, 
Wiley-Uss, Inc., New York, NY. 1994; Neben et al.. Experimental Hematology 22:353-359, 

30 1994; Cobblestone area forming cell assay, Ploemacher, R.E. In Culture of Hematopoietic 
Cells, R.L Freshney, et al eds. Vol pp. 1-21, Wiley-Liss, Inc., New York, NY. 1994; Long 
term bone marrow cultures in the presence of stromal cells, Spooncer, E., Dexter, M. and 
Allen, T. In Culture of Hematopoietic Cells, RJ. Freshney, et al eds. Vol pp. 163-179, 
Wiley-Liss, Inc., New York, NY. 1994; Long term culture initiating cell assay, Sutherland, 
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H.J. In Culture of Hematopoietic Celk. R.I. Freshney, et al. eds. Vol pp. 139-162, Wiley-Liss, 
Inc., New York, NY. 1994. 



Tissue Growth Activity 

A protein of the present invention also may have utility in compositions used for 
bone, cartilage, tendon, ligament and/or nerve Hssue growth or regeneraUon, as well as 
for wound healing and tissue repair and replacement, and in the treatment of bums, 
incisions and ulcers. 

A protein of the present invention, which induces cartilage and/or bone growth 
in circumstances where bone is not normally formed, has application in the healing of 
bone fractures and cartilage damage or defects in humans and other animals. Such a 
preparation employing a protein of the invention may have prophylactic use in dosed as 
well as open fracture reduction and also in the improved fixation of artificial joints. De 
novo bone formation induced by an osteogenic agent contributes to the repair of 
congenital, trauma induced, or oncologic resection induced craniofacial defects, and also 
is useful in cosmetic plastic surgery. 

A protein of this invention may also be used in the treatinent of periodontal 
disease, and in other tooth repair processes. Such agents may provide an environment 
to atti-act bone-forming cells, stimulate growth of bone-forming cells or induce 
differentiation of progenitors of bone-forming cells. A protein of the invention may also 
be useful in the treatinent of osteoporosis or osteoartiuitis, such as through stimulation 
of bone and/or cartilage repair or by blocking inflammation or processes of tissue 
desta^ction (collagenase activity, osteoclast activity, etc.) mediated by inflammatory 
processes. 

Another category of tissue regeneration activity that may be attributable to the 
protein of the present invention is tendon/ligament formation. A protein of the present 
invention, which induces tendon/ligament-like tissue or other tissue formation in 
circumstances where such tissue is not normally formed, has application in the healing of 
tendon or Ugament tears, deformities and other tendon or ligament defects in humans and 
other animals. Such a preparation employing a tendon/ligament-like tissue inducing 
protein may have prophylactic use in preventing damage to tendon or ligament tissue, as 
well as use in the improved fixation of tendon or ligament to bone or other tissues, and 
in repairing defects to tendon or ligament tissue. De novo tendon/ligament-Iike tissue 
formation induced by a composition of the present invention conh-ibutes to the repair of 
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congenital, trauma induced, or other tendon or ligament defects of other origin, and is also 
useful in cosmetic plastic surgery for attachment or repair of tendons or ligaments. The 
compositions of the present invention may provide an environment to attract tendon- or 
ligament-forming cells, stimulate growth of tendon- or ligament-forming cells, induce 
5 differentiation of progenitors of tendon- or ligament-forming cells, or induce growth of 
tendon/ligament cells or progenitors ex vivo for return in vivo to effect tissue repair. The 
compositions of the invention may also be useful in the treatment of tendinitis, carpal 
tunnel syndrome and other tendon or ligament defects. The compositions may also 
include an appropriate matrix and /or sequestering agent as a carrier as is well known in 
10 the art. 

The protein of the present invention may also be useful for proliferation of neural 
cells and for regeneration of nerve and brain tissue, i.e. for the treatment of central and 
peripheral nervous system diseases and neuropathies, as weD as mechanical and 
traimiatic disorders, which involve degeneration, death or trauma to neural cells or nerve 

15 tissue. More specifically, a protein may be used in the treatment of diseases of the 
peripheral nervous system, such as peripheral nerve injuries, peripheral neuropathy and 
localized neuropathies, and central nervous system diseases, such as Alzheimer's, 
Parkinson's disease, Huntington's disease, amyotrophic lateral sclerosis, and Shy-Drager 
syndrome. Further conditions which may be treated in accordance with the present 

2 0 invention include mechanical and traumatic disorders, such as spinal cord disorders, head 
trauma and cerebrovascular diseases such as stroke. Peripheral neuropathies resulting 
from chemotherapy or other medical therapies may also be treatable using a protein of the 
invention. 

Proteins of the invention may also be useful to promote better or faster closure of 

2 5 non-healing wounds, including without limitation pressure ulcers, ulcers associated with 

vascular insufficiency, surgical and traumatic wounds, and the like. 

It is expected that a protein of the present invention may also exhibit activity for 
generation or regeneration of other tissues, such as organs (including, for example, 
pancreas, liver, intestine, kidney, skin, endotheliimi), muscle (smooth, skeletal or cardiac) 

3 0 and vascular (including vascular endothelium) tissue, or for promoting the growth of cells 

comprising such tissues. Part of the desired effects may be by inhibition or modulation 
of fibrotic scarring to allow normal tissue to regenerate. A protein of the invention may 
also exhibit angiogenic activity. 



51 



10 



15 



WO 98/17687 

PCTAIS5>7/19590 

A protein of the present invention may also be useful for gut protection or 
regeneration and treatment of lung or Kver fibrosis, reperfusion injury in various tissues, 
and conditions resulting from systemic cytokine damage. 

A protein of the present invention may also be useful for promoting or inhibiting 
> differentiation of tissues described above from precursor tissues or cells; or for inhibiting 
the growth of tissues described above. 

The activity of a protein of the invention may, among other means, be measured 
by the follov^^ing methods: 

Assays for tissue generation activity include, without limitation, those described 
in: International Patent Publication No. WO95/16035 (bone, cartilage, tendon); 
International Patent Publication No. WO95/05846 (nerve, neuronal); International Patent 
Publication No. WO91/07491 (skin, endothelium ). 

Assays for wound healing activity include, without limitation, those described in: 
Epidermal Wound Healing, pps. 71-112 (Maibach, HI and Rovee, DT, eds.). Year 
Book Medical Publishers, Inc., Chicago, as modified by Eaglstein and Mertz, J. Invest. 
Dermatol 71:382-84 (1978). 



Activin/Inhibin Activity 

A protein of the present invention may also exhibit activin- or inhibin-related 
20 activities. Inhibins are characterized by tiieir ability to inhibit the release of folUcle 
stimulating hormone (FSH), while activins and are characterized by their abUity to 
stimulate the release of follicle stimulating hormone (FSH). Thus, a protein of the present 
invention, alone or in heterodimers witii a member of the inhibin o family, may be useful 

as a contaraceptive based on the ability of inhibins to decrease fertility in female mammals 
25 and decrease spermatogenesis in male mammals. Administration of sufficient amounts 
of otiier inhibins can induce infertility in these mammals. Alternatively, the protein of tiie 
invention, as a homodimer or as a heterodimer with other protein subunits of the inhibin- 
P group, may be useful as a fertility inducing therapeutic, based upon the ability of activin 
molecules in stimulating FSH release from cells of the anterior pihiitary. See, for example, 
30 United States Patent 4,798,885. A protein of the invention may also be useful for 
advancement of tiie onset of fertility in sexuaUy immature mammals, so as to increase \he 
lifetime reproductive performance of domestic animals such as cows, sheep and pigs. 

l^e activity of a protein of the invention may, among otiier means, be measured 
by the following methods: 
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Assays for activin/ inhibin activity include, without limitation, those described in: 
Vale et al.. Endocrinology 91:562-572, 1972; Ling et al.. Nature 321:779-782, 1986; Vale et 
al. Nature 321:776-779, 1986; Mason et al.. Nature 318:659-663, 1985; Forage et al., Proc. 
Natl. Acad. Sci. USA 83:3091-3095, 1986. 

5 

Chemotactic/Chemokinetic Activity 

A protein of the present invention may have chemotactic or chemokinetic activity 
(e.g., act as a chemokine) for mammalian cells, including, for example, monocytes, 
fibroblasts, neutrophils, T-cells, mast cells, eosinophils, epithelial and/ or endothelial ceUs. 

1 0 Chemotactic and chemokinetic proteins can be used to mobilize or attract a desired cell 
population to a desired site of action. Chemotactic or chemokinetic proteins provide 
partioilar advantages in treatment of v^^ounds and other trauma to tissues, as well as in 
treatment of localized infections. For example, attraction of lymphocytes, monocytes or 
neutrophils to tumors or sites of infection may result in improved immtme responses 

1 5 against the tumor or infecting agent. 

A protein or peptide has chemotactic activity for a particular cell population if it 
can stimulate, directly or indirectly, the directed orientation or movement of such cell 
popxilation. Preferably, the protein or peptide has the ability to directly stimulate directed 
movement of cells. Whether a particular protein has chemotactic activity for a population 

20 of cells can be readily determined by employing such protein or peptide in any known 
assay for cell chemotaxis. 

The activity of a protein of the invention may, among other means, be measured 
by the following methods: 

Assays for chemotactic activity (which will identify proteins that induce or prevent 

25 chemotaxis) consist of assays that measure the ability of a protein to induce the migration 
of cells across a membrane as well as the ability of a protein to induce the adhesion of one 
cell population to another cell population. Suitable assays for movement and adhesion 
include, without limitation, those described in: Current Protocols in Immtmology, Ed by 
J.E. Coligan, A.M. Kruisbeek, D.H. Margulies, E.M. Shevach, W.Strober, Pub. Greene 

3 0 Publishing Associates and Wiley-Interscience (Chapter 6.12, Measurement of alpha and 
beta Chemokines 6.12.1-6.12.28; Taub et al. J. Clin. Invest. 95:1370-1376, 1995; Lind et al. 
APMIS 103:140-146, 1995; MuUer et al Eur. J. Immunol. 25: 1744-1748; Gruber et al. J. of 
Immunol. 152:5860-5867, 1994; Johnston et al. J. of Immunol. 153: 1762-1768, 1994, 
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Hemostatic and Thrombolytic Activity 

A protein of the invention may also exhibit hemostatic or thrombolytic activity. 
As a result, such a protein is expected to be useful in treatment of various coagulation 
disorders (including hereditary disorders, such as hemophilias) or to enhance coagulation 
5 and other hemostatic events in treating wounds resulting from trauma, surgery or other 
causes. A protein of the invention may also be useful for dissolving or inhibiting 
formation of thromboses and for treatment and prevention of conditions resulting 
therefrom (such as, for example, infarction of cardiac and centi-al nervous system vessels 
(e.g., stroke). 

10 The activity of a protein of the invention may, among other means, be measured 

by the following methods: 

Assay for hemostatic and tiirombolytic activity include, witiiout limitation, tiiose 
described in: Unet et al., J. Clin. Pharmacol. 26:131-140, 1986; Burdick et al.. Thrombosis 
Res. 45:413-419, 1987; Humphrey et al.. Fibrinolysis 5:71-79 (1991); Schaub, Prostaglandins 

15 35:467-474, 1988. 



Receptor /Ligand Activity 

A protein of the present invention may also demonsb-ate activity as receptors, 
receptor ligands or inhibitors or agonists of receptor/ligand interactions. Examples of 

20 such receptors and ligands include, witfiout limitation, cytokine receptors and their 
ligands, receptor kinases and their ligands, receptor phosphatases and their ligands, 
receptors involved in cell-cell interactions and their ligands (including without limitation, 
cellular adhesion molecules (such as selectins, integrins and their ligands) and 
receptor/ligand pairs involved in antigen presentation, antigen recognition and 

2 5 development of cellular and humoral immune responses). Receptors and ligands are also 
useful for screening of potential peptide or small molecule inhibitors of the relevant 
receptor/ligand interaction. A protein of tiie present invention (including, without 
limitation, fragments of receptors and Ugands) may tiiemselves be useful as inhibitors of 
receptor/ligand interactions. 

30 The activity of a protein of tiie invention may, among otiier means, be measured 

by the following metiiods: 

Suitable assays for receptor-ligand activity include without limitation those 
described in:Current Protocols in Immunology, Ed by J.E. Coligan, A.M. Kruisbeek, D.H. 
MarguHes, E.M. Shevach, W5trober, Pub. Greene Publishing Associates and 
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Wiley-Interscience (Chapter 7.28, Measurement of Cellular Adhesion under static 
conditions 7.28.1-7.28.22), Takai et aL, Proc. Natl, Acad. Sd. USA 84:6864-6868, 1987; 
Biereretal., J. Exp. Med. 168:1145-1156, 1988; Rosenstein etaL, J. Exp. Med. 169:149-160 
1989; Stoltenborg et al, J. Immunol Methods 175:59-68, 1994; Stitt et al. Ceil 80:661-670, 
5 1995. 

Anti-Inflammatorv Activity 

Proteins of the present invention may also exhibit anti-inflammatory activity. The 
anti-inflammatory activity may be achieved by providing a stimidus to cells involved in 

1 0 ttie inflammatory response, by inhibiting or promoting cell-cell interactions (such as, for 
example, cell adhesion), by inhibiting or promoting chemotaxis of cells involved in the 
inflammatory process, inhibiting or promoting cell extravasation, or by stimulating or 
suppressing production of other factors which more directly inhibit or promote an 
inflammatory response. Proteins exhibiting such activities can be used to treat 

15 inflammatory conditions including chronic or acute conditions), including without 
limitation inflammation associated with infection (such as septic shock, sepsis or systenuc 
inflammatory response syndrome (SIRS)), ischemia-reperfusion injury, endotoxin 
lethality, arthritis, complement-mediated hyperacute rejection, nephritis, cytokine or 
chemokine-induced lung injury, inflammatory bowel disease, Crohn's disease or resulting 

2 0 from over production of cytokines such as TNF or IL-l . Proteins of the invention may also 
be useful to treat anaphylaxis and hypersensitivity to an antigenic substance or material. 

Cadherin /Tumor Invasion Suppressor Activity 

Cadherins are calcium-dependent adhesion molecules that appear to play major 

2 5 roles during development, particularly in defining specific cell types. Loss or alteration 

of normal cadherin expression can lead to changes in cell adhesion properties linked to 
tumor growth and metastasis. Cadherin malfunction is also implicated in other human 
diseases, such as pemphigus vulgaris and pemphigus foliaceus (auto-immune blistering 
skin diseases), Crohn's disease, and some developmental abnormalities. 

3 0 The cadherin superfamily includes well over forty members, each with a distinct 

pattern of expression. All members of the superfamily have in common conserved 
extraceUular repeats (cadherin domains), but structural differences are found in other 
parts of the molecule. The cadherin domains bind calcium to form their tertiary structure 
and thus calcium is required to mediate their adhesion. Only a few amino acids in the 
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first cadherin domain provide the basis for homophilic adhesion; modification of this 
recognition site can change the specificity of a cadherin so that instead of recognizing only 
itself, the mutant molecule can now also bind to a different cadherin. In addition, some 
cadherins engage in heterophiHc adhesion with other cadherins. 
5 E-cadherin, one member of the cadherin superfamily, is expressed in epithelial cell 

types. Pathologically, if E-cadherin expression is lost in a tumor, the malignant cells 
become invasive and the cancer metastasizes. Transfection of cancer ceU lines with 
polynucleotides expressing E-cadherin has reversed cancer-associated changes by 
returning altered cell shapes to normal, restoring oeUs' adhesiveness to each other and to 
10 their subshrate, decreasing the ceU growth rate, and drastically reducing anchorage- 
independent ceU growth. Thus, reintroducing E-cadherin expression reverts carcinomas 
to a less advanced stage. It is likely that other cadherins have the same invasion 
suppressor role in carcinomas derived from other tissue types. Therefore, proteins of the 
present invention with cadherin activity, and polynucleotides of the present invention 
15 encoding such proteins, can be used to treat cancer. Introducing such proteins or 
polynucleotides into cancer cells can reduce or eliminate the cancerous changes observed 
in these cells by providing normal cadherin expression. 

Cancer ceDs have also been shown to express cadherins of a different tissue type 
than their origin, *us allowing these cells to invade and metastasize in a different tissue 
in the body. Proteins of the present invention with cadherin activity, and polynucleotides 
of the present invention encoding such proteins/can be substitiited in these cells for the 
inappropriately expressed cadherins, restoring normal cell adhesive properties and 
reducing or eliminating the tendency of the ceUs to metastasize. 

Additionally, proteins of the present invention with cadherin activity, and 
polynucleotides of the present invention encoding such proteins, can used to generate 
antibodies recognizing and binding to cadherins. Such antibodies can be used to block 
the adhesion of inappropriately expressed tumor-cell cadherins, preventing the cells from 
forming a tumor elsewhere. Such an anti-cadherin antibody can also be used as a marker 
for the grade, pathological type, and prognosis of a cancer, i.e. the more progressed the 
cancer, the less cadherin expression there will be, and this decrease in cadherin expression 
can be detected by the use of a cadherin-binding antibody. 

Fragments of proteins of the present invention with cadherin activity, preferably 
a polypeptide comprising a decapeptide of the cadherin recognition site, and poly- 
nucleotides of the present invention encoding such protein fragments, can also be used 
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to block cadherin function by binding to cadherins and preventing them from binding in 
ways that produce tmdesirable effects. Additionally, fragments of proteins of the present 
invention with cadherin activity, preferably truncated soluble cadherin fragments which 
have been found to be stable in the circulation of cancer patients, and polynucleotides 
5 encoding such protein fragments, can be used to disturb proper cell-cell adhesion. 

Assays for cadherin adhesive and invasive suppressor activity include, without 
limitation, those described in: Hortsch et al. J Biol Chem 270 (32): 18809-18817, 1995; 
Miyaki et al. Oncogene 11: 2547-2552, 1995; Ozawa et al. Cell 63: 1033-1038, 1990. 

10 Tiunor Inhibition Activity 

In addition to the activities described above for immtmological treatment or 
prevention of tumors, a protein of the invention may exhibit other anti-tumor activities. 
A protein may irJiibit tumor growth directly or indirectly (such as, for example, via 
ADCC). A protein may exhibit its timior inhibitory activity by acting on tumor tissue or 

15 tumor precursor tissue, by inhibiting formation of tissues necessary to support tumor 
growth (such as, for example, by inhibiting angiogenesis), by causing production of other 
factors, agents or cell types which inhibit tumor growth, or by suppressing, eliminating 
or inhibiting factors, agents or cell types which promote tumor growth. 

20 Other Activities 

A protein of the invention may also exhibit one or more of the following additional 
activities or effects: inhibiting the growth, infection or function of, or killing, infectious 
agents, including, without limitation, bacteria, viruses, fungi and other parasites; effecting 
(suppressing or enhancing) bodily characteristics, including, without limitation, height, 

2 5 weight, hair color, eye color, skin, fat to lean ratio or other tissue pigmentation, or organ 

or body part size or shape (such as, for example, breast augmentation or diminution, 
change in bone form or shape); effecting biorhythms or caricadic cycles or rhythms; 
effecting the fertility of male or female subjects; effecting the metabolism, catabolism, 
anabolism, processing, utilization, storage or elimination of dietary fat, lipid, protein, 

3 0 carbohydrate, vitamins, minerals, cofactors or other nutritional factors or component(s); 

effecting behavioral characteristics, including, without limitation, appetite, libido, stress, 
cognition (including cognitive disorders), depression (including depressive disorders) and 
violent behaviors; providing analgesic effects or other pain reducing effects; promoting 
differentiation and growth of embryonic stem ceUs in lineages other than hematopoietic 
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lineages; hormonal or endocrine activity; in the case of enzymes, correcting deficiencies 
of the enzyme and treating defidency-relaled diseases; treatment of hyperproliferative 
disorders (such as, for example, psoriasis); immunoglobulin-like activity (such as, for 
example, the ability to bind antigens or complement); and the abUity to act as an antigen 
i in a vaccine composition to raise an immune response against such protein or another 
material or entity which is cross-reactive with such protein. 
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ADMINISTRATION AND DOSINC 

A protein of the present invention (from whatever source derived, including 
without limitaHon from recombinant and non-recombinant sources) may be used in a 
pharmaceutical composition when combined with a pharmaceutically acceptable carrier. 
Such a composition may also contain (in addition to protein and a carrier) diluents, fiUers, 
salts, buffers, stabilizers, solubilizers, and other materials weU known in tihe art. The term 
"pharmaceutically acceptable" means a non-toxic material that does not interfere with the 
effectiveness of the biological activity of the active ingredient(s). The characteristics of the 
carrier will depend on the route of administration. The pharmaceutical composition of 
the invention may also contain cytokines, lymphokines, or other hematopoietic factors 
such as M-CSF, GM-CSF, TNF, IL-1, IL-2, IL-3, IL-4, IL-5, IL-6, IL-7, IL^, IL-9, IL-10, IL-11, 
IL-12, IL-13, IL-14, IL-15, IFN, TOFO, TOF1,TOF2, G-CSF, Meg-CSF, thrombopoietin, stem 
ceU factor, and erythropoietin. The pharmaceutical composition may further contain other 
agents which either enhance the activity of the protein or complunent its activity or use 
in treatment. Such additional factors and/or agents may be included in the 
pharmaceutical composition to produce a synergistic effect with protein of tiie invention, 
or to minimize side effects. Conversely, protein of the present invention may be included 
in formulations of the particular cytokine, lymphokinc, other hematopoietic factor, 
thrombolytic or anti-thrombotic factor, or anti-inflammatory agent to minimize side effects 
of the cytokine, lymphokine, other hematopoietic factor, thrombolytic or anti-thrombotic 
factor, or anti-inflammatory agent. 

A protein of the present invention may be active in multimers (e.g., heterodimers 
or homodimers) or complexes with itself or other proteins. As a result, pharmaceutical 
compositions of the invention may comprise a protein of the invention in such multimeric 
or complexed form. 
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The phannaceutical composition of the invention may be in the form of a complex 
of tfie protein(s) of present invention along with protein or peptide antigens. The protein 
and/or peptide antigen will deliver a stimulatory signal to both B and T lymphocytes. B 
lymphocytes will respond to antigen through their surface immunoglobulin receptor. T 
5 lymphocytes will respond to antigen through the T cell receptor (TCR) following 
presentation of the antigen by MHC proteins, MHC and structurally related proteins 
including those encoded by class I and class II MHC genes on host ceUs will serve to 
present the peptide antigen(s) to T lymphocytes. The antigen components could also be 
supplied as purified MHC-peptide complexes alone or with co-stimulatory molecules that 
1 0 can directly signal T cells. Alternatively antibodies able to bind surface immunolgobulin 
and other molecules on B cells as well as antibodies able to bind the TCR and other 
molecules on T cells can be combined with the pharmaceutical composition of the 
invention. 

The pharmaceutical composition of the invention may be in the form of a liposome 
15 in which protein of the present invention is combined, in addition to other 
pharmaceutically acceptable carriers, with amphipathic agents such as lipids which exist 
in aggregated form as micelles, insoluble monolayers, liquid crystals, or lamellar layers 
in aqueous solution. Suitable lipids for liposomal formulation include, without limitation, 
monoglycerides, diglycerides, sulfatides, lysolecithin, phospholipids, saponin, bile acids, 
2 0 and the like. Preparation of such liposomal formulations is within the level of skill in the 
art, as disclosed, for example, in U.S. Patent No. 4,235,871; U.S. Patent No. 4,501,728; U.S. 
Patent No. 4,837,028; and U.S. Patent No. 4,737,323, all of which are incorporated herein 
by reference. 

As used herein, the term "therapeutically effective amount" means the total 
2 5 amount of each active component of the pharmaceutical composition or method that is 
sufficient to show a meaningful patient benefit, i.e., treatment, healing, prevention or 
amelioration of the relevant medical condition, or an increase in rate of treatment, healing, 
prevention or amelioration of such conditions. When applied to an individual active 
ingredient, administered alone, the term refers to that ingredient alone. When applied to 
30 a combination, the term refers to combined amounts of the active ingredients that result 
in the therapeutic effect, whether administered in combination, serially or simultaneously. 

In practicing the method of treatment or use of the present invention, a 
therapeutically effective amount of protein of the present invention is administered to a 
mammal having a condition to be treated. Protein of the present invention may be 
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administered in accordance with the method of the invention either alone or in 
combination with other therapies such as treatments employing cytokines, lymphokines 
or other hematopoietic factors. When co-administered with one or more cytokines, 
lymphokines or other hematopoietic factors, protein of the present invention may be 
5 administered either simultaneously with the cytokine(s), lymphokine(s), other 
hematopoietic factor(s)^ thrombolytic or anti-thrombotic factors, or sequentially. If 
administered sequentially, the attending physician will decide on the appropriate 
sequence of administering protein of the present invention in combination with 
cytokine(s), lymphokine(s), other hematopoietic f actor(s), thrombolytic or anti-thrombotic 
10 factors. 

Administration of protein of the present invention used in the pharmaceutical 
composition or to practice the method of the present invention can be carried out in a 
variety of conventional ways, such as oral ingestion, inhalation, topical application or 
cutaneous, subcutaneous, intraperitoneal, parenteral or intravenous injection. 

1 5 Intravenous administration to the patient is preferred. 

When a therapeutically effective amoimt of protein of the present invention is 
administered orally, protein of the present invention will be in the form of a tablet, 
capsule, powder, solution or elixir. When administered in tablet form, the pharmaceutical 
composition of the invention may additionally contain a solid carrier such as a gelatin or 

20 an adjuvant. The tablet, capsule, and powder contain from about 5 to 95% protein of the 
present invention, and preferably from about 25 to 90% protein of the present invention. 
When administered in liquid form, a liquid carrier such as water, petroleiun, oils of animal 
or plant origin such as peanut oil, mineral oil, soybean oil, or sesame oil, or synthetic oils 
may be added. The liquid form of the pharmaceutical composition may further contain 

25 physiological saline solution, dextrose or other saccharide solution, or glycols such as 
ethylene glycol, propylene glycol or polyethylene glycol. When administered in liquid 
form, the pharmaceutical composition contains from about 05 to 90% by weight of protein 
of the present invention, and preferably from about 1 to 50% protein of the present 
invention. 

3 0 When a therapeutically effective amount of protein of the present invention is 

administered by intravenous, cutaneous or subcutaneous injection, protein of the present 
invention wUl be in the form of a pyrogen-free, parenterally acceptable aqueous solution. 
The preparation of such parenterally acceptable protein solutions, having due regard to 
pH, isotonidty, stability, and the like, is within the skill in the art. A preferred 
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pharmaceutical coinposition for intravenous, cutaneous, or subcutaneous injection shooild 
contain, in addition to protein of the present invention, an isotonic vehicle such as Sodium 
Oiloride Injection, Ringer's Injection, E>extrose Injection, Dextrose and Sodium Chloride 
Injection, Lactated Ringer's Injection, or other vehicle as known in the art. The 
5 pharmaceutical composition of the present invention may also contain stabilizers, 
preservatives, buffers, antioxidants, or other additives known to those of skill in the art 
The amovmt of protein of the present invention in the pharmaceutical composition 
of the present invention wiU depend upon the nature and severity of the condition being 
treated, and on the nature of prior treatments which the patient has imdergone. 

10 Ultimately, the attending physician wUl dedde the amount of protein of the present 
invention with which to treat each individual patient. Initially, the attending physician 
will administer low doses of protein of the present invention and observe the patient's 
response. Larger doses of protein of the present invention may be administered tmtil the 
optimal therapeutic effect is obtained for the patient, and at that point the dosage is not 

1 5 increased further. It is contemplated that the various pharmaceutical compositions used 
to practice the method of the present invention should contain about 0.01 pg to about 100 
mg (preferably about O.lng to about 10 mg, more preferably about 0.1 pg to about 1 mg) 
of protein of the present invention per kg body weight. 

The duration of intravenous therapy using the pharmaceutical composition of the 

2 0 present invention will vary, depending on the severity of the disease being treated and 
the condition and potential idiosyncratic response of each individual patient. It is 
contemplated that the duration of each application of the protein of the present invention 
will be in the range of 12 to 24 hours of continuous intravenous administration. 
Ultimately the attending physician will dedde on the appropriate duration of intravenous 

2 5 therapy using the pharmaceutical composition of the present invention. 

Protein of the invention may also be used to immunize animals to obtain 
polydonal and monodonal antibodies which specifically react with the protein. Such 
antibodies may be obtained using either the entire protein or fragments thereof as an 
immimogen. The peptide inmnmogens additionally may contain a cysteine residue at the 

3 0 carboxyl terminus, and are conjugated to a hapten such as keyhole limpet hemocyanin 

(KLH). Methods for synthesizing such peptides are known in the art, for example, as in 
R.P, Merrifield, J, Amer.Chem.Soc. 85, 2149-2154 (1963); J.L. Krstenansky, et al, FEBS Lett. 
211, 10 (1987). Monoclonal antibodies binding to the protein of the invention may be 
useful diagnostic agents for the immunodetection of the protein. Neutralizing monodonal 
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antibodies binding to the protein may also be useful therapeutics for both conditions 
associated with the protein and also in the treatment of some forms of cancer where 
abnormal expression of the protein is involved. In the case of cancerous cells or leukemic 
cells, neutralizing monoclonal antibodies against the protein may be useful in detecting 
• and preventing the metastatic spread of the cancerous cells, which may be mediated by 
the protein. 

For compositions of the present invention which are useful for bone, cartilage, 
tendon or ligament regeneration, the therapeutic method includes administering the 
composition topically, systematically, or locaUy as an implant or device. When 
administered, the therapeutic composition for use in this invention is, of course, in a 
pyrogen-free, physiologicaUy acceptable form. Further, the composition may desirably 
be encapsulated or injected in a viscous form for deUvery to the site of bone, cartilage or 
tissue damage. Topical administration may be suitable for wound healing and tissue 
repair. Therapeutically useful agents other than a protein of the invention which may also 
optionally be included in the composition as described above, may alternatively or 
additionally, be administered simultaneously or sequentially with the composition in the 
methods of the invention. Preferably for bone and/or cartilage formation, the 
composition would include a matrix capable of delivering the protein-containing 
composition to the site of bone and/or cartilage damage, providing a structure for the 
developing bone and cartilage and optimally capable of being resorbed into the body. 
Such mabices may be formed of materials presently in use for other implanted medical 
applications. 

The choice of mati-ix material is based on biocompatibility, biodegradability, 
mechanical properties, cosmetic appearance and interface properties. The particular 
application of the compositions will define the appropriate formulation. Potential 
mah-ices for the compositions may be biodegradable and chemically defined calcium 
sulfate, hricakiumphosphate, hydroxyapatite, polylactic add, polyglycolic add and 
polyanhydrides. Other potential materials are biodegradable and biologically well- 
defined, such as bone or dermal coUagen. Further matrices are comprised of pure proteins 
or extiracellular matrix components. Other potential matrices are nonbiodegradable and 
chemicaUy defined, such as sintered hydroxapatite, bioglass, aluminates, or other 
ceramics. Mahices may be comprised of combinations of any of the above mentioned 
types of material, sudi as polylactic add and hydroxyapatite or coUagen and 
tiricalciumphosphate. The bioceramics may be altered in composition, sudi as in caldum- 
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alumina te-phosphate and processing to alter pore size, particle size, particle shape, and 
biodegradability. 

Presently preferred is a 50:50 (mole weight) copolymer of lactic add and glycolic 
acid in the form of porous particles having diameters ranging from 150 to 800 microns. 
5 In some applications, it will be useful to utilize a sequestering agent, such as 
carboxymethyl cellulose or autologous blood clot, to prevent the protein compositions 
from disassociating from the matrix. 

A preferred family of sequestering agents is cellulosic materials such as 
alkylcelluloses (including hydroxyalkylcelluloses), including methylcellulose, 

10 ethylcellulose, hydroxyethylcellulose, hydroxypropylcellulose, hydroxypropyl- 
methylceUulose, and carboxymethylcellulose, the most preferred being cationic salts of 
carboxymethylceDulose (CMC). Other preferred sequestering agents include hyaluronic 
acid, sodiimi alginate, poly(ethylene glycol), polyoxyethylene oxide, carboxyvinyl 
polymer and poly(vinyl alcohol). The amount of sequestering agent useful herein is 0.5-20 

15 wt%, preferably 1-10 wt% based on total formulation weight, which represents the 
amount necessary to prevent desorbtion of the protein from the polymer matrix and to 
provide appropriate handling of the composition, yet not so much that the progenitor cells 
are prevented from infiltrating the matrix, thereby providing the protein the opportunity 
to assist the osteogenic activity of the progenitor cells. 

20 In further compositions, proteins of the invention may be combined with other 

agents beneficial to the treatment of the bone and/or cartilage defect, wound, or tissue in 
question. These agents include various growth factors such as epidermal growth factor 
(EGF), platelet derived growth factor (PDGF), transforming growth factors (TGF-a and 
TGF-P), and insulin-like growth factor (IGF). 

25 The therapeutic compositions are also presently valuable for veterinary 

applications. Particularly domestic animals and thoroughbred horses, in addition to 
hiunans, are desired patients for such treatment with proteins of the present invention. 

The dosage regimen of a protein-contairung pharmaceutical composition to be 
used in tissue regeneration wiU be determined by the attending physician considering 

3 0 various factors which modify the action of the proteins, e.g., amount of tissue weight 
desired to be formed, the site of damage, the condition of the damaged tissue, the size of 
a woimd, type of damaged tissue (e.g., bone), the patient's age, sex, and diet, the severity 
of any infection, time of administration and other clinical factors. The dosage may vary 
with the type of matrix used in the reconstitution and with inclusion of other proteins in 
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the pharmaceutical composition. For example, the addition of other known growth 
factors, such as IGF I (insulin like growth factor I), to the final composition, may also effect 
the dosage. Progress can be monitored by periodic assessment of tissue/bone growth 
and/or repair, for example. X-rays, histomorphometric determinations and tetracycline 
5 labeling. 

Polynucleotides of the present invention can also be used for gene therapy. Such 
polynucleotides can be introduced either in vivo or ex vivo into cells for expression in a 
mammalian subject. Polynucleotides of the invention may also be administered by other 
known methods for introduction of nucleic acid into a ceU or organism (including, without 
1 0 limitation, in the form of viral vectors or naked DNA). 

Cells may also be culhired ex vivo in the presence of proteins of the present 
invention in order to proliferate or to produce a desired effect on or acHvity in such cells. 
Treated cells can then be introduced in vivo for therapeutic purposes. 



15 



Patent and literahjre references dted herein are incorporated by reference as if 
fully set forth. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Jacobs, Kenneth 
McCoy, John M. 
LaVallie, Edward R. 
Racie, Lisa A. 
Merberg, David 
Treacy, Maurice 
Spaul ding , Vi kki 
Agostino, Michael 

(ii) TITLE OF INVENTION: SECRETED PROTEINS AND POLYNUCLEOTIDES 
ENCODING THEM 

(iii) NUMBER OF SEQUENCES: 39 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Genetics Institute, Inc. 

(B) STREET: 87 CambridgePark Drive 

(C) CITY: Cambridge 

(D) STATE: MA 

(E) COUNTRY: U.S.A. 

(F) ZIP: 02140 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: PatentIn Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Sprunger, Suzanne A. 

(B) REGISTRATION NUMBER: 41,323 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (617) 498-8284 

(B) TELEFAX: (617) 876-5851 



(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1605 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

ACGCTGACCA TCACAGGCAC ACAGAGGCAC ATCCACCTCA CATCCACGTC ATACTTGTGT 60 

ACTCTCAGGG TTCAGTCTTT CATCCTATCC CTCTCTGATC TGTGCCTCCC AATACCTTCC 120 

AAGATGTTTA CAGAGACCCT TCTCCCTGTG CAGTTAGGAG TGTAAGGCAA GAGAGCCCCT 180 

ACTTCATGGG GCAGATCAAG AGCTGAGACC AAAGATGGTC TATGTTGCTG ACCTTGTCCT 240 

GTCCTCCTGC TGTCTTAAAC TATGATCCCT GCTGCGGTCA CTGAAGCCTT TCCCTGTGAG 300 

CAGTGGTGTG TGAGAGCCAG GCGTCCCTCT GCCTGCCCAC TCAGTGGCAA CACCCGGGAG 360 

CTGTTTTGTC CTTTGTGGAG CCTCAGCAGT TCCCTCTTTC AGAACTCACT GCCAAGAGCC 420 

CTGAACAGGA GCCACCATGC AGTGCTTCAG CTTCATTAAG ACCATGATGA TCCTCTTCAA 480 

TTTGCTCATC TTTCTGTGTG GTGCAGCCCT GTTGGCAGTG GGCATCTGGG TGTCAATCGA 540 

TGGGGCATCC TTTCTGAAGA TCTTCGGGCC ACTGTCGTCC AGTGCCATGC AGTTTGTCAA 600 

CGTGGGCTAC TTCCTCATCG CAGCCGGCGT TGTGGTCTTT GCTCTTGGTT TCCTGGGCTG 660 

CTATGGTGCT AAGACTGAGA GCAAGTGTGC CCTCGTGACG TTCTTCTTCA TCCTCCTCCT 720 

CATCTTCATT GCTGAGGTTG CAGCTGCTGT GGTCGCCTTG GTGTACACCA CAATGGCTGA 780 

GCACTTCCTG ACGTTGCTGG TAGTGCCTGC CATCAAGAAA GATTATGGTT CCCAGGAAGA 840 

CTTCACTCAA GTGTGGAACA CCACCATGAA AGGGCTCAAG TGCTGTGGCT TCACCAACTA 900 

TACGGATTTT GAGGACTCAC CCTACTTCAA AGAGAACAGT GCCTTTCCCC CATTCTGTTG 960 

CAATGACAAC GTCACCAACA CAGCCAATGA AACCTGCACC AAGCAAAAGG CTCACGACCA 1020 

AAAAGTAGAG GGTTGCTTCA ATCAGCTTTT GTATGACATC CGAACTAATG CAGTCACCGT 1080 

GGGTGGTGTG GCAGCTGGAA TTGGGGGCCT CGAGCTGGCT GCCATGATTG TGTCCATGTA 1140 

TCTGTACTGC AATCTACAAT AAGTCCACTT CTGCCTCTGC CACTACTGCT GCCACATGGG 1200 

AACTGTGAAG AGGCACCCTG GCAAGCAGCA GTGATTGGGG GAGGGGACAG GATCTAACAA 1260 

TGTCACTTGG GCCAGAATGG ACCTGCCCTT TCTGCTCCAG ACTTGGGGGT AGATAGGGAC 1320 

CACTCCTTTT AGGCGATGCC TGACTTTCCT TCCATTGGTG GGTGGATGGG TGGGGGGCAT 13 80 

TCCAGAGCCT CTAAGGTAGC CAGTTCTGTT GCCCATTCCC CCAGTCTATT AAACCCTTGA 1440 

TATGCCCCCT AGGCCTAGTG GTGATCCCAG TGCTCTACT6 GGGGATGAGA GAAAGGCATT 1500 

TTATAGCCTG GGCATAAGTG AAATCAGCAG AGCCTCTGGG TGGATGTGTA GAAGGCACTT 1560 
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CAAAATGCAT AAACCTGTTA CAATGTTGAA AAAAAAAAAA AAAAA 1605 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 241 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 

Met Gin Cys Phe Ser Phe He Lys Thr Met Met He Leu Phe Asn Leu 
15 10 15 

Leu He Phe Leu Cys Gly Ala Ala Leu Leu Ala Val Gly He Trp Val 
20 25 30 

Ser He Asp Gly Ala Ser Phe Leu Lys He Phe Gly Pro Leu Ser Ser 
35 40 45 

Ser Ala Met Gin Phe Val Asn Val Gly Tyr Phe Leu He Ala Ala Gly 
50 55 60 

Val Val Val Phe Ala Leu Gly Phe Leu Gly Cys Tyr Gly Ala Lys Thr 
65 70 75 80 

Glu Ser Lys Cys Ala Leu Val Thr Phe Phe Phe He Leu Leu Leu He 
85 90 95 

Phe He Ala Glu Val Ala Ala Ala Val Val Ala Leu Val Tyr Thr Thr 
100 105 110 

Met Ala Glu His Phe Leu Thr Leu Leu Val Val Pro Ala He Lys Lys 
115 120 125 

Asp Tyr Gly Ser Gin Glu Asp Phe Thr Gin Val Trp Asn Thr Thr Met 
130 135 140 

Lys Gly Leu Lys Cys Cys Gly Phe Thr Asn Tyr Thr Asp Phe Glu Asp 
145 150 155 160 

Ser Pro Tyr Phe Lys Glu Asn Ser Ala Phe Pro Pro Phe Cys Cys Asn 
165 170 175 

Asp Asn Val Thr Asn Thr Ala Asn Glu Thr Cys Thr Lys Gin Lys Ala 
180 185 190 

His Asp Gin Lys Val Glu Gly Cys Phe Asn Gin Leu Leu Tyr Asp He 
195 200 205 
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Arg Thr Asn Ala Val Thr Val Gly Gly Val Ala Ala- Gly He Gly Gly 
210 215 220 

Leu Glu Leu Ala Ala Met He Val Ser Met Tyr Leu Tyr Cys Asn Leu 
225 230 235 240 

Gin 



(2) INFORMATION FOR SEQ ID N0:3: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 377 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 



CAACCTCGTG GTCACCGCAC CGGGGCTGAT CAAGGGTGAC GCCTGCTTCA 


CATCTCTAAT 


60 


GAACACCCTC ATGACGTCGC TACCAGCACT AGTGCAGCAA CAGGGAAGGC 


TGCTTCTGGC 


120 


TGCTAATGTG GCCACCCTGG GGCTCGTCAT GGCCCGGCTC CTTAGCACCT 


CTCCAGCTCT 


180 


TCAGGGAACA CCAGCATCCC GAGGGTTCTT CGCAGCTGCC ATCCTCTTCC 


TATCACAGTC 


240 


CCACGTGGCG CGGGCCACCC CGGGCTCAGA CCAGGCAGTG CTAGCCCTGT 


CCCCTGAGTA 


300 


TGAGGGCATC TGGGCCGACC TGCAGGAGCT CTGGTTCCTG GGCATNCAAG 


CCTTCACCGG 


350 


CTGTGTGCCT CTGCTGC 




377 


<2) INFORMATION FOR SEQ ID NO: 4: 







(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 106 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNES S : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Asn Thr Leu Met Thr Ser Leu Pro Ala Leu Val Gin Gin Gin Gly 
1 5 10 15 
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Arg Leu Leu Leu Ala Ala Asn Val Ala Thr Leu Gly Leu Leu Met Ala 
20 25 30 

Arg Leu Leu Ser Thr Ser Pro Ala Leu Gin Gly Thr Pro Ala Ser Arg 
35 40 45 

Phe Ala Ala Ala lie Leu Phe Leu Ser Gin Ser His Val Ala 
55 60 

Thr Pro Gly Ser Asp Gin Ala Val Leu Ala Leu Ser Pro Glu 
70 75 80 

Gly lie Trp Ala Asp Leu Gin Glu Leu Trp Phe Leu Gly Xaa 
85 90 95 

Gin Ala Phe Thr Gly Cys Val Pro Leu Leu 
100 105 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 245 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



Gly Phe 
50 

Arg Ala 
65 

Tyr Glu 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

TANGACTCCT CTCTGCNGAG ACGCGACTGG CGGNTCCAGC AGGGANTACC TTTTTTATAA 60 

ACCCKGGGGG NCCACACACA CACACACACA CACACACACA CACACACACA CACACACACA 120 

CATTTTTGAT CCCTTGCTTC CNTCCCCCAG TGCGTTCTGT GATCGCCAAG TTCAAAGCTG 180 

TGCACATGTG GACACTCAAT AAATGTTCAT TGOJGACAAA AAAAAAAAAA AAAAAAAAAA 240 

AAAAA 245 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2384 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



69 



wo 98/17687 PCT/US97/19590 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

CCAAAACATG CTCAAAAGTA GAACATTTTG TTTCAATATT AGGAAAGTGC TTTGAATCCC 60 

CTTGGACGAC AAAAGCGTTG TCTGAGACAG CATGCGAAGA CTCAGAGGAA AACAAGCAGA 120 

GAATAACAGG TGCCCA6ACT CTACCAAAGC ATGTTTCTAC CAGCAGTGAT GAAGGGAGCC 180 

CCAGTGCCAG TACACCAATG ATCAATAAAA CTGGCTTTAA ATTTTCAGCT GAGAAGCCTG 240 

TGATTGAAGT TCCCAGCATG ACAATCCTGG ATAAAAAGGA TGGAGAGCAG GCCAAAGCCC 300 

TGTTTGAGAA AGTGAGGAAG TTCCGTGCCC ATGTGGAAGA TAGTGACTTG ATCTATAAAC 360 

TCTATGTGGT CCAAACAGTT ATCAAAACAG CCAAGTTCAT TTTTATTCTC TGCTATACAG 420 

CGAACTTTGT CAACGCAATC AGCTTTGAAC ACGTCTGCAA GCCCAAAGTT GAGCATCTGA 480 

TTGGTTATGA GGTATTTGAG TGCACCCACA ATATGGCTTA CATGTTGAAA AAGCTTCTCA 540 

TCAGTTACAT ATCCATTATT TGTGTTTATG GCTTTATCTG CCTCTACACT CTCTTCTGGT 600 

TATTCAGGAT ACCTTTGAAG GAATATTCTT TCGAAAAAGT CAGAGAAGAG AGCAGTTTTA 660 

GTGACATTCC AGATGTCAAA AACGATTTTG CGTTCCTTCT TCACATGGTA GACCAGTATG 720 

ACCAGCTATA TTCCAAGCGT TTTGGTGTGT TCTTGTCAGA AGTTAGTGAA AATAAACTTA 780 

GGGAAATTAG TTTGAACCAT GAGTGGACAT TTGAAAAACT CAGGCAGCAC ATTTCACGCA 840 

ACGCCCAGGA CAAGCAGGAG TTGCATCTGT TCATGCTGTC GGGGGTGCCC GATGCTGTCT 900 

TTGACCTCAC AGACCTGGAT GTGCTAAAGC TTGAACTAAT TCCAGAAGCT AAAATTCCTG 960 

CTAAGATTTC TCAAATGACT AACCTCCAAG AGCTCCACCT CTGCCACTGC CCTGCAAAAG 1020 

TTGAACAGAC TGCTTTTAGC TTTCTTCGCG ATCACTTGAG ATGCCTTCAC GTGAAGTTCA 1080 

CTGATGTGGC TGAAATTCCT GCCTGGGTGT ATTTGCTCAA AAACCTTCGA GAGTTGTACT 1140 

TAATAGGCAA TTTGAACTCT GAAAACAATA AGATGATAGG ACTTGAATCT CTCCGAGAGT 1200 

TGCGGCACCT TAAGATTCTC CACGTGAAGA GCAATTTGAC CAAAGTTCCC TCCAACATTA 1260 

CAGATGTGGC TCCACATCTT ACAAAGTTAG TCATTCATAA TGACGGCACT AAACTCTTGG 1320 

TACTGAACAG CCTTAAGAAA ATGATGAATG TCGCTGAGCT GGAACTCCAG AACTGTGAGC 1380 

TAGAGAGAAT CCCACATGCT ATTTTCAGCC TCTCTAATTT ACAGGAACTG GATTTAAAGT 1440 

CCAATAACAT TCGCACAATT GAGGAAATCA TCAGTTTCCA GCATTTAAAA CGACTCACTT 1500 

GTTTAAAATT ATGGCATAAC AAAATTGTTA CTATTCCTCC CTCTATTACC CATGTCAAAA 1560 

ACTTGGAGTC ACTTTATTTC TCTAACAACA AGCTCGAATC CTTACCAGTG GCAGTATTTA 1620 
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GTTTACAGAA ACTCAGATGC TTAGATGTGA GCTACAACAA CATTTCAATG ATTCCAATAG 1680 

AAATAGGATT GCTTCAGAAC CTGCAGCATT TGCATATCAC TGGGAACAAA GTGGACATTC 1740 

TGCCAAAACA ATTGTTTAAA TGCATAAAGT TGAGGACTTT GAATCTGGGA CAGAACTGCA 1800 

TCACCTCACT CCCAGAGAAA GTTGGTCAGC TCTCCCAGCT CACTCAGCTG GAGCTGAAGG 1860 

GGAACTGCTT GGACCGCCTG CCAGCCCAGC TGGGCCAGTG TCGGATGCTC AAGAAAAGCG 1920 

GGCTTGTTGT GGAAGATCAC CTTTTTGATA CCCTGCCACT CGAAGTCAAA GAGGCATTGA 1980 

ATCAAGACAT AAATATTCCC TTTGCAAATG GGATTTAMAC TAAGATAATA TATGCACAGT 2040 

GATGTGCAGG AACAACTTCC TAGATTGCAA GTGCTCACGT ACAAGTTATT ACAAGATAAT. 2100 

GCATTTTAGG AGTAGATACA TCTTTTAAAA TAAAACAGAG AGGATGCATA GAAGGCTGAT 2160 

AGAAGACATA ACTGAATGTT CAATGTTTGT AGGGTTTTAA GTCATTCATT TCCAAATCAT 2220 

TTTTTTTTTT CTTTTGGGGA AAGGGAAGGA AAAATTATAA TCACTAATCT TGGTTCTTTT 2280 

TAAATTGTTT GTAACTTGGA TGCTGCCGCT ACTGAATGTT TACAAATTGC TTGCCTGCTA 2340 

AAGTAAATGA TTAAATTGAC ATTTTCTTAC TATAAAAAAA AAAA 2384 
(2> INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHAEy^CTERISTICS : 

(A) LENGTH: 614 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

Met lie Asn Lys Thr Gly Phe Lys Phe Ser Ala Glu Lys Pro Val He 
1 5 10 15 

Glu Val Pro Ser Met Thr He Leu Asp Lys Lys Asp Gly Glu Gin Ala 
20 25 30 

Lys Ala Leu Phe Glu Lys Val Arg Lys Phe Arg Ala His Val Glu Asp 
35 40 45 



Ser Asp Leu He Tyr Lys Leu Tyr Val Val Gin Thr Val He Lys Thr 
50 55 60 

Ala Lys Phe He Phe He Leu Cys Tyr Thr Ala Asn Phe Val Asn Ala 
65 70 75 80 
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lie Ser Phe Glu His Val Cys Lys Pro Lys Val Glu His Leu lie Gly 
85 90 95 

Tyr Glu Val Phe Glu Cys Thr His Asn Met Ala Tyr Met Leu Lys Lys 
100 105 110 

Leu Leu lie Ser Tyr He Ser He He Cys Val Tyr Gly Phe He Cys 
115 120 125 

Leu Tyr Thr Leu Phe Trp Leu Phe Arg He Pro Leu Lys Glu Tyr Ser 
130 135 

Phe Glu Lys Val Arg Glu Glu Ser Ser Phe Ser Asp He Pro Asp Val 
145 150 155 160 

Lys Asn Asp Phe Ala Phe Leu Leu His Met Val Asp Gin Tyr Asp Gin 
165 170 175 

Leu Tyr Ser Lys Arg Phe Gly Val Phe Leu Ser Glu Val Ser Glu Asn 
180 185 190 

Lys Leu Arg Glu He Ser Leu Asn His Glu Trp Thr Phe Glu Lys Leu 
195 200 205 

Arg Gin His He Ser Arg Asn Ala Gin Asp Lys Gin Glu Leu His Leu 
210 215 220 

Phe Met Leu Ser Gly Val Pro Asp Ala Val Phe Asp Leu -Thr Asp Leu 
225 230 235 240 

Asp Val Leu Lys Leu Glu Leu He Pro Glu Ala Lys He Pro Ala Lys 
245 250 255 

He Ser Gin Met Thr Asn Leu Gin Glu Leu His Leu Cys His Cys Pro 
260 265 270 

Ala Lys Val Glu Gin Thr Ala Phe Ser Phe Leu Arg Asp His Leu Arg 
275 280 285 

Cys Leu His Val Lys Phe Thr Asp Val Ala Glu He Pro Ala Trp Val 
290 295 300 

Tyr Leu Leu Lys Asn Leu Arg Glu Leu Tyr Leu He Gly Asn Leu Asn 
305 310 315 320 

Ser Glu Asn Asn Lys Met He Gly Leu Glu Ser Leu Arg Glu Leu Arg 
325 330 335 

His Leu Lys He Leu His Val Lys Ser Asn Leu Thr Lys Val Pro Ser 
340 345 350 

Asn He Thr Asp Val Ala Pro His Leu Thr Lys Leu Val He His Asn 
355 360 365 

Asp Gly Thr Lys Leu Leu Val Leu Asn Ser Leu Lys Lys Met Met Asn 
370 375 380 



72 



wo 98/17687 



PCTAJS97/19590 



Val Ala Glu Leu Glu Leu Gin Asn Cys Glu Leu Glu Arg lie Pro His 
385 . 390 395 400 

Ala lie Phe Ser Leu Ser Asn Leu Gin Glu Leu Asp Leu Lys Ser Asn 
405 410 415 

Asn He Arg Thr He Glu Glu He He Ser Phe Gin His Leu Lys Arg 
420 425 430 

Leu Thr Cys Leu Lys Leu Trp His Asn Lys He Val Thr He Pro Pro 
435 440 445 

Ser He Thr His Val Lys Asn Leu Glu Ser Leu Tyr Phe Ser Asn Asn 
450 455 460 

Lys Leu Glu Ser Leu Pro Val Ala Val Phe Ser Leu Gin Lys Leu Arg 
465 470 475 480 

Cys Leu Asp Val Ser Tyr Asn Asn He Ser Met He Pro He Glu He 
485 490 495 

Gly Leu Leu Gin Asn Leu Gin His Leu His He Thr Gly Asn Lys Val 
500 505 510 

Asp He Leu Pro Lys Gin Leu Phe Lys Cys He Lys Leu Arg Thr Leu 
515 520 525 

Asn Leu Gly Gin Asn Cys He Thr Ser Leu Pro Glu Lys Val Gly Gin 
530 535 540 

Leu Ser Gin Leu Thr Gin Leu Glu Leu Lys Gly Asn Cys Leu Asp Arg 
545 550 555 560 

Leu Pro Ala Gin Leu Gly Gin Cys Arg Met Leu Lys Lys Ser Gly Leu 
565 570 575 

Val Val Glu Asp His Leu Phe Asp Thr Leu Pro Leu Glu Val Lys Glu 
580 585 590 

Ala Leu Asn Gin Asp He Asn He Pro Phe Ala Asn Gly He Xaa Thr 
595 600 605 

Lys He He Tyr Ala Gin 
610 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2386 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

CCAAAACATC AAACCCTTTT CTTGTAGCAG CACAGGATTC TGAGACAGAT TATGTCACAA 60 

CAGATAATTT AACAAAGGTG ACTCAGGAAG TCGTGGCAAA CATGCCTGAA GGCCTGACTC 120 

CAGATTTAGT ACAGGAAGCA TGTGAAAGTG AATTGAATGA AGTTACTGGT ACAAAGATTG 180 

CTTATGAAAC AAAAATGGAC TTGGTTCAAA CATCAGAAGT TATGCAAGAG TCACTCTATC 240 

CTGCAGCACA GCTTTGCCCA TCATTTGAAG AGTCAGAAGC TACTCCTTCA CCAGTTTTGC 300 

CTGACATTGT TATGGAAGCA CCATTGAATT CTGCAGTTCC TAGTGCTGGT GCTTCCGTGA 360 

TACAGCCCAG CTCATCACCA TTAGAAGCTT CTTCAGTTAA TTATCAAAGC ATAAAACATG 420 

AGCCTGAAAA CCCCCCACCA TATGAAGAGG CCATGAGTGT ATCACTAAAA AAAGTATCAG 480 

GAATAAAGGA AGAAATTAAA GAGCCTGAAA ATATTAATGC AGCTCTTCAA GAAACAGAAG 540 

CTCCTTATAT ATCTATTGCA TGTGATTTAA TTAAAGAAAC AAAGCTTTCT GCTGAACCAG 600 

CTCCGGATTT CTCTGATTAT TCAGAAATGG CAAAAGTTGA ACAGCCAGTG CCTGATCATT 660 

CTGAGCTAGT TGAAGATTCC TCACCTGATT CTGAACCAGT TGACTTATTT AGTGATGATT 720 

CAATACCTGA CGTTCCACAA AAACAAGATG AAACTGTGAT GCTTGTGAAA GAAAGTCTCA 780 

CTGAGACTTC ATTTGAGTCA ATGATAGAAT ATGAAAATAA GGAAAAACTC AGTGCTTTGC 840 

CACCTGAGGG AGGAAAGCCA TATTTGGAAT CTTTTAAGCT CAGTTTAGAT AACACAAAAG 9O0 

ATACCCTGTT ACCTGATGAA GTTTCAACAT TGAGCAAAAA GGAGAAAATT CCTTTGCAGA 960 

TGGAGGAGCT CAGTACTGCA GTTTATTCAA ATGATGACTT ATTTATTTCT AAGGAAGCAC 1020 

AGATAAGAGA AACTGAAACG TTTTCAGATT CATCTCCAAT TGAAATTATA GATGAGTTCC 1080 

CTACATTGAT CAGTTCTAAA ACTGATTCAT TTTCTAAATT AGCCAGGGAA TATACTGACC 1140 

TAGAAGTATC CCACAAAAGT GAAATTGCTA ATGCCCCGGA TGGAGCTGGG TCATTGCCTT 1200 

GCACAGAATT GCCCCATGAC CTTTCTTTGA AGAACATACA ACCCAAAGTT GAAGAGAAAA 12 60 

TCAGTTTCTC AGATGACTTT TCTAAAAATG GGTCTGCTAC ATCAAAGGTG CTCTTATTGC 1320 

CTCCAGATGT TTCTGCTTTG GCCACTCAAG CAGAGATAGA GAGCATAGTT AAACCCAAAG 1380 

TTCTTGTGAA AGAAGCT6AG AAAAAACTTC CTTCCGATAC AGAAAAAGAG GACAGATCAC 1440 

CATCTGCTAT ATTTTCAGCA GAGCTGAGTA AAACTTCAGT TGTTGACCTC CTGTACTGGA 1500 

GAGACATTAA GAAGACTGGA GTGGTGTTTG GTGCCAGCCT ATTCCTGCTG CTTTCATTGA 1560 

CAGTATTCAG CATTGTGAGC GTAACAGCCT ACATTGCCTT GGCCCTGCTC TCTGTGACCA 1620 

74 



wo 98/17687 PCT/US97/19590 

TCAGCTTTAG GATATACAAG GGTGTGATCC AAGCTATCCA GAAATCAGAT GAAGGCCACC 1680 

CATTCAGGGA AGTTGCTATA TCTGAGGAGT TGGTTCAGAA GTACAGTAAT TCTGCTCTTG 1740 

GTCATGTGAA CTGCACGATA AAGGAACTCA GGCGCCTCTT CTTAGTTGAT GATTTAGTTG 1800 

ATTCTCTGAA GTTTGCAGTG TTGATGTGGG TATTTACCTA TGTTGGTGCC TTGTTTAATG 1860 

GTCTGACACT ACTGATTTTG GCTCTCATTT CACTCTTCAG TGTTCCTGTT ATTTATGAAC 1920 

GGCATCAGGC ACAGATAGAT CATTATCTAG GACTTGCAAA TAAGAATGTT AAAGATGCTA 1980 

TGGCTAAAAT CCAAGCAAAA ATCCCTGGAT TGAAGCGCAA AGCTGAATGA AAACGCCCAA 2040 

AATAATTAGT AGGAGTTCAT CTTTAAAGGG GATATTCATT TGATTATACG GGGGAGGGTC 2100 

AGGGAAGAAC GAACCTTGAC GTTGCAGTGC AGTTTCACAG ATCGTTGTTA GATCTTTATT 2160 

TTTAGCCATG CACTGTTGTG AGGAAAAATT ACCTGTCTTG ACTGCCATGT GTTCATCATC 2220 

TTAAGTATTG TAAGCTGCTA TGTATGGATT TAAACCGTAA TCATATCTTT TTCCTATCTG 2280 

AGGCACTGGT GGAATAAAAA ACCTGTATAT TTTACTTTGT TGCAGATAGT CTTGCCGCAT 2340 

CTTGGCAAGT TGCAGAGATG GTGGAGCTAG AAAAAAAAAA AAAAAA 2386 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

{A} LENGTH: 642 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Met Pro Glu Gly Leu Thr Pro Asp Leu Val Gin Glu Ala Cys Glu Ser 
15 10 15 

Glu Leu Asn Glu Val Thr Gly Thr Lys lie Ala Tyr Glu Thr Lys Met 
20 25 30 

Asp Leu Val Gin Thr Ser Glu Val Met Gin Glu Ser Leu Tyr Pro Ala 
35 40 45 

Ala Gin Leu Cys Pro Ser Phe Glu Glu Ser Glu Ala Thr Pro Ser Pro 
50 55 60 

Val Leu Pro Asp lie Val Met Glu Ala Pro Leu Asn Ser Ala Val Pro 
65 70 75 80 
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Ser Ala Gly Ala Ser Val lie Gin Pro Ser Ser Ser Pro Leu Glu Ala 
85 90 95 

Ser Ser Val Asn Tyr Glu Ser He Lys His Glu Pro Glu Asn Pro Pro 
100 105 110 

Pro Tyr Glu Glu Ala Met Ser Val Ser Leu Lys Lys Val Ser Gly He 
115 120 125 

Lys Glu Glu He Lys Glu Pro Glu Asn He Asn Ala Ala Leu Gin Glu 

135 140 

Thr Glu Ala Pro Tyr He Ser He Ala Cys Asp Leu He Lys Glu Thr 
150 155 160 

Lys Leu Ser Ala Glu Pro Ala Pro Asp Phe Ser Asp Tyr Ser Glu Met 
165 170 175 

Ala Lys Val Glu Gin Pro Val Pro Asp His Ser Glu Leu Val Glu Asp 

185 190 

Ser Ser Pro Asp Ser Glu Pro Val Asp Leu Phe Ser Asp Asp Ser He 

200 205 

Pro Asp Val Pro Gin Lys Gin Asp Glu Thr Val Met Leu Val Lys Glu 
210 215 220 

Ser Leu Thr Glu Thr Ser Phe Glu Ser Met He Glu Tyr Glu Asn Lys 
2" 230 235 240 

Glu Lys Leu Ser Ala Leu Pro Pro Glu Gly Gly Lys Pro Tyr Leu Glu 
245 250 255 

Ser Phe Lys Leu Ser Leu Asp Asn Thr Lys Asp Thr Leu Leu Pro Asp 
260 265 270 

Glu Val Ser Thr Leu Ser Lys Lys Glu Lys He Pro Leu Gin Met Glu 
275 280 285 

Glu Leu Ser Thr Ala Val Tyr Ser Asn Asp Asp Leu Phe He Ser Lys 
290 295 300 



Glu Ala Gin He Arg Glu Thr Glu Thr Phe Ser Asp Ser Ser Pro II 
305 310 315 



e 

320 



Glu He He Asp Glu Phe Pro Thr Leu He Ser Ser Lys Thr Asp Ser 



325 



330 



335 



Phe Ser Lys Leu Ala Arg Glu Tyr Thr Asp Leu Glu Val Ser His Lvs 
340 345 



350 



Ser Glu He Ala Asn Ala Pro Asp Gly Ala Gly Ser Leu Pro Cys Thr 
355 360 



365 



Glu Leu Pro His Asp Leu Ser Leu Lys Asn He Gin Pro Lys Val Glu 
370 375 380 



76 



wo 98/17687 



PCT/US97/19590 



Glu Lys He Ser Phe Ser Asp Asp Phe Ser Lys Asn Gly Ser Ala Thr 
385 390 395 400 

Ser Lys Val Leu Leu Leu Pro Pro Asp Val Ser Ala Leu Ala Thr Gin 
405 410 415 

Ala Glu He Glu Ser He Val Lys Pro Lys Val Leu Val Lys Glu Ala 
420 425 430 

Glu Lys Lys Leu Pro Ser Asp Thr Glu Lys Glu Asp Arg Ser Pro Ser 
435 440 445 

Ala He Phe Ser Ala Glu Leu Ser Lys Thr Ser Val Val Asp Leu Leu 
450 455 460 

Tyr Trp Arg Asp He Lys Lys Thr Gly Val Val Phe Gly Ala Ser Leu 
465 470 475 480 

Phe Leu Leu Leu Ser Leu. Thr Val Phe Ser He Val Ser Val Thr Ala 
485 490 495 

Tyr He Ala Leu Ala Leu Leu Ser Val Thr He Ser Phe Arg He Tyr 
500 505 510 

Lys Gly Val He Gin Ala He Gin Lys Ser Asp Glu Gly His Pro Phe 
515 520 525 

Arg Glu Val Ala He Ser Glu Glu Leu Val Gin Lys Tyr Ser Asn Ser 
530 535 540 

Ala Leu Gly His Val Asn Cys Thr He Lys Glu Leu Arg Arg Leu Phe 
545 550 555 560 

Leu Val Asp Asp Leu Val Asp Ser Leu Lys Phe Ala Val Leu Met Trp 
565 570 575 

Val Phe Thr Tyr Val Gly Ala Leu Phe Asn Gly Leu Thr Leu Leu He 
* 580 585 590 

Leu Ala Leu He Ser Leu Phe Ser Val Pro Val He Tyr Glu Arg His 
595 600 605 

Gin Ala Gin He Asp His Tyr Leu Gly Leu Ala Asn Lys Asn Val Lys 
610 615 620 

Asp Ala Met Ala Lys He Gin Ala Lys He Pro Gly Leu Lys Arg Lys 
625 630. 635 640 

Ala Glu 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 344 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

GGCGGCTGCG GANCCGGCGG TCCTTGCGCT CCCCAACANC GGCGCCGGGG GCGCGGGGGC 60 

GCCGTCGGGC ACAGTCCCGG TGCTCTTCTG TTTCTCAGTC TTCGCGCGAC CCTCGTCGGT 120 

GCCACACGGG GCGGGCTACA AGCTGCTCAT CCAGAAGTTC CTCAGCCTGT ACGGCGACCA 180 

GATCNACATG CACCGCAAAT TCGTGGTGCA GCTGTTCGCC GAGGAGTGGG GCCAGTACGT 240 

GGACTTGCCC AAGGGCTTCN CGGTGAGCGA GCGCTGCAAG GTGCGCCTCG TGCCGCTGCA 300 

TATCCAGCTC ACTACCCTGG GAAATCTTAC ACCTTCAAGC ACTG 344 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 631 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

ACATGATTCA GAGTCCTGTG GGTAAATTCA TATGCAATAA TCTTATTCCA ATCAATCTGT 60 

AAAGTAAAAG CANTACATCC ACATTAACAT TATAACATCT TACAGTAATA TAAAAGCCAA 120 

ATCATTGTTG GTACGTCATT TTCTTTAAAG TGAACAATTT AAGAAAACTT CACAAGAGTC 180 

TGCACTTTGG AAAGATACGA TCAGAGTACA CAGTAGAGAC AAAACAGGCA TCTTCATTGT 240 

AATTTTTTTT AATAAATAAA AGCACATTAA CAAAAAAGGA AGGTAAGCAG CACCGGAAGC 300 

CTTTGACGTT TGTAACTAAA TGCTGGTACT CAATTGAATC GAGCTGGTTA AGTTTCACTA 360 

GGAGGCGCNA AAAAGGAGCC GTTTTTGACT TAACATTTTA ATTCTAGTAG AGATAAGAAG 420 

AGCTTGTGTG GGCTTACAGT CCTTCACCTG ACTGTCCTTC ACCAGTGAGT AGCATACCAG 480 

TTCTTCAAAT GTCCTATACT TTGGAAAGCA GACCCGACTC TGGAGCACTC GCCTTAATTA 540 

GATTCTGAAT TTCCTTGAAT TTTGGATGGT CCTTATCAGC TACCAGCTGA AGCAGAACAG 600 
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CCTCACTCGT GGTCACTATG ATCCCGGTTC G 631 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Val Leu He Ser Tyr Gin Leu Lys Gin Asn Ser Leu Thr Arg Gly 
15 10 15 

His Tyr Asp Pro Gly Ser 
20 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 60 
AAAAAAAAAA 70 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 428 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
{D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
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AGCACGCGGT CCTGCCCGTG GACGGGGCAA CGCTCGCA6A TCTGATGCGC CAGCGGGGCA 50 
TCAACATGCG CTACCTGGGC AAGGTGCTGG AGC-TCCTCCT GCGGARCCCG GCCCGCCACC 120 
AGCTGGACCA CGTCTTTAAA ATCGGCATTG GAGAACTCAT CACCCGCTCG SCCAAGCACA 180 
TCTTCAAGAC GTACTTACAG GGAGTCGAGC TCTCCGGCCT CTCAGCCGCC ATCAGCCACT 240 
TCCTGAACTG CTTCCTGAGC TCCTACCCAA ACCCCGTCGC CCACCTCCCC GCCGACGAGC 300 
TGGTGTCCAA GAAGCGGAAT AAGAGGAGGA AAAACCGGCC CCCGGGGGCT GCAGATAACA 360 
CAGCCTGGGC TGTCATGACC CCCCAGGAGC TCTGGAAGAA CATCTCCCAG GAGGCCAAGA 420 
ACTACTTT 

428 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 128 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Met Arg Gin Arg Gly lie Asn Met Arg Tyr Leu Gly Lys Val Leu Glu 
1 5 10 15 

Leu Val Leu Arg Xaa Pro Ala Arg His Gin Leu Asp His Val Phe Lys 
20 25 30 

He Gly He Gly Glu Leu He Thr Arg Ser Xaa Lys His He Phe Lys 
35 40 45 

Thr Tyr Leu Gin Gly Val Glu Leu Ser Gly Leu Ser Ala Ala He Ser 
5t> 55 50 

His Phe Leu Asn Cys Phe Leu Ser Ser Tyr Pro Asn Pro Val Ala His 
" 70 75 go 

Leu Pro Ala Asp Glu Leu Val Ser Lys Lys Arg Asn Lys Arg Arg Lys 
85 90 95 

Asn Arg Pro Pro Gly Ala Ala Asp Asn Thr Ala Trp Ala Val Met Thr 
100 105 

Pro Gin Glu Leu Trp Lys Asn He Cys Gin Glu Ala Lys Asn Tyr Phe 

120 125 
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(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 245 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID WO: 16: 

TGGTGGGGGA GGATGTGCCC ACCCTGAGAC CCGGAGGAGA CGGGCNTTTG CCTGGGTTTG 60 

CGGAGAGCCG CTTATGGGTG TGGTCCGTCC AGACACCTTG TTTCAAGGGG GATGGGCGTG 120 

AGCGGGCAAG CAGAGCANCC CCACCGNTGA GCAAGAACTT TTTTTTGTTT TTAAACCATC 180 

ACGTCCTCAT TTCACATTGG AATAAAGTGA GTTTTTGAAA AAAAAAAAAA AAAAAAAAAA 240 

AAAAA 245 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 566 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:17: 

CAGTGAGCCC TTTGAAAAAT AAACATCCAG ATGAAGATGC TGTGGAAGCT GAGGGGCATG 60 

AGGTAAAAAG ACTCAGGTTT GACAAAGAAG GTGAAGTCAG AGAAACAGCC AGTCAAACGA 120 

CTTCCAGCGA T^TTTCTTCA GTTATGGTAG GAGAAACAGA AGCATCATCT TCATCTCAGG 180 

ATAAAGACAA AGATAGCCGT TGTWCCCGGC AGCACTGTWC AGAAGAGGAT GAAGAAGAGG 240 

ATGAAGAGGA AGAAGAAGAG TCTTTTATGA CATCAAGAGA AATGATCCCA GAAAGAAAAA 300 

ATCAAGAAAA AGAATCTGAT GATGCCTTAA CTGTGAATGA AGAGACTTCT GAGGAAAATA 360 

ATCAAATGGA GGAATCTGAT GTGTCTCAAG CTGAGAAAGA TTTGCTACAT TCTGAAGGTA 420 

GTGAAAACGA AGGCCCTGTA AGTAGTAGTT CTTCTGACTG CCGTGAAACA GAAGAATTAG 480 
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TAGGATCCAA TTCCAGTAAA ACTGGAGAGA TTCTTTCAGA ATCATCCATG GAAAATGATG 540 
ACGAAGCCAC AGAAGTCACC GATGAA 556 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 141 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Met Val Gly Glu Thr Glu Ala Ser Ser Ser Ser Gin Asp Lys Asp Lys 
1 5 10 15 

Asp Ser Arg Cys Xaa Arg Gin His Cys Xaa Glu Glu Asp Glu Glu Glu 
20 25 30 

Asp Glu Glu Glu Glu Glu Glu Ser Phe Met Thr Ser Arg Glu Met He 
35 40 45 

Pro Glu Arg Lys Asn Gin Glu Lys Glu Ser Asp Asp Ala Leu Thr Val 
50 55 60 

Asn Glu Glu Thr Ser Glu Glu Asn Asn Gin Met Glu Glu Ser Asp Val 

... 70 75 80 

Ser Gin Ala Glu Lys Asp Leu Leu His Ser Glu Gly Ser Glu Asn Glu 
85 90 95 

Gly Pro Val Ser Ser Ser Ser Ser Asp Cys Arg Glu Thr Glu Glu Leu 
100 105 110 

Val Gly Ser Asn Ser Ser Lys Thr Gly Glu He Leu Ser Glu Ser Ser 
115 120 125 

Met Glu Asn Asp Asp Glu Ala Thr Glu Val Thr Asp Glu 
130 135 140 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 531 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

TCATCATGGC TATAAATACC AAAACGATTT GGATCCATTT ATGTTTGTAG GATAATATAC 60 

TACTGACTGA CTTGACTGTC AGGTTCACAA CAGCTAGATG ATATATTTAT GACTATGTCT 120 

AATAGTTGAA ATAAAATCTG AATATTGATT TACTATACCC AAGAGGGGAG AAAAATTAAC 180 

CATTGTAAAT TTTTAAAAAT TTTTTCAAAA ATGTTAAAAT GAGGCAAATT TAAGTTTACA 240 

AATTTTGAAA TTTTCTTTTG AATATTTATG AAATTGTCAG TAAACTTACC TAAGATCCTG 300 

TGACCTTTTG ATATTTTTTA TTTTAA1TGT AGTGCCATGG ACCATTTGTA AACAAATTGA 360 

TTTACTTTTG TTGGTTGTAA GTTGAAGATT TAGCATTATG ACTTTGAGGT CTGTGGTTTT 420 

ATTTGTAAAC TTGCAATTGC TATATTTGCA AGGGCAAATG TATTTCTTTA TTAAATAAAG 480 

TACAATAATG GTGAATGTAC CAAAATGACA TCACTTAAAA AAAAAAAAAA A 531 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1163 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 

KCTGGAACCA CGCGGARGAA GGAAGAGACG CAGGCAGGCT GCGGTTACCC AAGCGGSCAC 60 

CCGGGCCTCA GGGACCCTTC CCCGAGAGAC GGCACCATGA CCCAGGGAAA GCTCTCCGTG 120 

GCTAACAAGC CCCTGGGACC GAGGGGCAGC AGCAKGTGCA TGGCGAGAAG AAGGAGCTCC 180 

AGCAGTGCCC TCAGCCCCAC CCTCCTATGA GGAACCACCT CTGGGGAGGG GATGAAGGCA 240 

GGGGCCTTCC CCCCAGCCCC CACAGCGGTG CCTCTCCACC CTAGCTGGGC CTATGTGGAC 300 

CCCAGCAGCA GCTCCAGCTA TGACAACGGT TTCCCCACCG GAGACCATGA GCTCTTCACC 360 

ACTTTCAGCT GGGATGACCA GAAAGTTCGT CGAGTCTTTG TCAGAAAGGT CTACACCATC 420 

CTCCTGATTC AGCTGCTGGT GACCTTGGCT GTCGTGGCTC TCTTTACTTT CTGTGACCCT 480 

GTCAAGGACT ATGTCCAGGC CAACCCAGGC TGGTACTGGG CATCCTATGC TGTGTTCTTT 540 

GCAACCTACC TGACCCTGGC TTGCTGTTCT GGACCCAGGA GGCATTTCCC CTGGAACCTG 600 
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ATTCTCCTGA CCGTCTTTAC CCTGTCCATG GCCTACCTCA CTGGGATGCT GTCCAGCTAC 660 

TACAACACCA CCTCCGTGCT GCTGTGCCTG GGCATCACGG CCCTTGTCTG CCTCTCAGTC 720 

ACCGTCTTCA GCTTCCAGAC CAAGTTCGAC TTCACCTCCT GCCAGGGCGT GCTCTTCGTG 780 

CTTCTCATGA CTCTTTTCTT CAGCGGACTC ATCCTGGCCA TCCTCCTACC CTTCCAATAT 840 

GTGCCCTGGC TCCATGCAGT TTATGCAGCA CTGGGAGCGG GTGTATTTAC ATTCTTCCTG 900 

GCACTTGACA CCCAGTTGCT GATGGGTAAC CGACGCCACT CGCTGAGCCC TGAGGAGTAT 960 

ATTTTTGGAG CCCTCAACAT TTACCTAGAC ATCATCTATA TCTTCACCTT CTTCCTGCAG 1020 

CTTTTTGGCA CTAACCGAGA ATGAGGAGCC CTCCCTCCCC CACCGTCCTC CAGAGAATCC 1080 

GCCCCTCCTG GTTCCCTGTC CCTCCCCTGC GCTCCTGCGA GACCAGATAT AAAACTAGCT 1140 
GCCAACCCAA AAAAAAAAAA AAA 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 270 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



1163 



Ui) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Met Lys Ala Gly Ala Phe Pro Pro Ala Pro Thr Ala Val Pro Leu His 
15 10 15 

Pro Ser Trp Ala Tyr Val Asp Pro Ser Ser Ser Ser Ser Tyr Asp Asn 
20 25 30 

Gly Phe Pro Thr Gly Asp His Glu Leu Phe Thr Thr Phe Ser Trp Asp 
35 40 45 

Asp Gin Lys Val Arg Arg Val Phe Val Arg Lys Val Tyr Thr He Leu 
50 55 60 

Leu He Gin Leu Leu Val Thr Leu Ala Val Val Ala Leu Phe Thr Phe 
'70 75 80 

Cys Asp Pro Val Lys Asp Tyr Val Gin Ala Asn Pro Gly Trp Tyr Trp 
85 90 95 

Ala Ser Tyr Ala Val Phe Phe Ala Thr Tyr Leu Thr Leu Ala Cys Cys 
100 105 
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Ser Gly Pro Arg Arg His Phe Pro Trp Asn Leu lie Leu Leu Thr Val 
115 120 125 

Phe Thr Leu Ser Met Ala Tyr Leu Thr Gly Met Leu Ser Ser Tyr Tyr 
130 135 140 

Asn Thr Thr Ser Val Leu Leu Cys Leu Gly lie Thr Ala Leu Val Cys 
145 150 155 160 

Leu Ser Val Thr Val Phe Ser Phe Gin Thr Lys Phe Asp Phe Thr Ser 
165 170 175 

Cys Gin Gly Val Leu Phe Val Leu Leu Met Thr Leu Phe Phe Ser Gly 
180 185 190 

Leu lie Leu Ala lie Leu Leu Pro Phe Gin Tyr Val Pro Trp Leu His 
195 200 205 

Ala Val Tyr Ala Ala Leu Gly Ala Gly Val Phe Thr Leu Phe Leu Ala 
210 215 220 

Leu Asp Thr Gin Leu Leu Met Gly Asn Arg Arg His Ser Leu Ser Pro 
225 230 235 240 

Glu Glu Tyr lie Phe Gly Ala Leu Asn lie Tyr Leu Asp lie He Tyr 
245 250 255 



He Phe Thr Phe Phe Leu Gin Leu Phe Gly Thr Asn Arg Glu 
260 265 270 

(2) INFORMATION FOR SEQ ID NO:22: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 624 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

CGCACCCCCT CCGGCCGCGG GCGCAKCGGG GGCGCTGGTG GAKCTGMGAA GGGCCASGTC 60 

CGGCGGGCGG GGCGGCGGCT GGCACTGGCT CCGGACTCTG CCCGGCCAGG GCGGCGGMTC 120 

CANCCGGGAG GGCGACGTGG AGCGGCCACK TGGAKCGGCC CGGGGGARGC TGGCGGCGGG 180 

AKGCGAGGCG CGGGCGGCGC AKCAKCCAKG AGCGCCCACG GAGSTGGACC CCCAGAKCCG 240 

CGCGGCGCCG CAGCAGTTCC AGGAAGGATG TTACCTTTGA CGATGACAGT GTTAATCCTG 300 

CTGCTGCTCC CCACGGGTCA GGCTGCCCCA AAGGATGGAG TCACAAGGCC AGAATCTGAA 360 
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GTGCAGCATC AGCTCCTGCC CAACCCCTTC CAGCCAGGCC AGGAGCAGGT CGGACTTCTG 420 

CAGAGCTACC TAAAGGGACT AGGAAGGACA GAAGTGCAAC TGGAGCATCT GAGCCGGGAG 480 

CAGGTTCTCC TCTACCTCTT TGCCCTCCAT GACTATGACC AGAGTGGACA GCTGGATGGC 540 

CTGGAGCTGC TGTCCATGTT GACAGCTGCT CTGGCCCCTG GAGCTGCCAA CTCTCCTACC 600 

ACCAACCCGG TGATCTTGAT AGTG 624 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Met Leu Pro Leu Thr Met Thr Val Leu lie Leu Leu Leu Leu Pro Thr 
15 10 15 

Gly Gin Ala Ala Pro Lys Asp Gly Val Thr Arg Pro Glu Ser Glu Val 
20 25 30 

Gin His Gin Leu Leu Pro Asn Pro Phe Gin Pro Gly Gin Glu Gin Leu 
35 40 45 

Gly Leu Leu Gin Ser Tyr Leu Lys Gly Leu Gly Arg Thr Glu Val Gin 
50 55 60 

Leu Glu His Leu Ser Arg Glu Gin Val Leu Leu Tyr Leu Phe Ala Leu 
65 70 75 80 

His Asp Tyr Asp Gin Ser Gly Gin Leu Asp Gly Leu Glu Leu Leu Ser 
85 90 95 

Met Leu Thr Ala Ala Leu Ala Pro Gly Ala Ala Asn Ser Pro Thr Thr 
100 105 110 

Asn Pro Val lie Leu He. Val 
115 

(2) INFORMATION FOR SEQ ID NO:24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 60 
AAAAAAAAAA AAAAAAAAAA 80 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2161 base pairs 

(B) TYPE: nucleic acid 

(C) STRAl^EDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

AGACAGGGAA TACTTTATTC AAAACCCATC ACAGAAATGG ACAGCTTGGG TCTGTAACAA 60 

AGCATTCATG TTTTAGAGCA TAGGTCAGTA ATTGTATATG AGAGCATACA CTGCTACATA 120 

CAAATTAACT GATCAGACCA CAACTTTTCA ATGTTTAAAA CAGAATAAGC TTCCCTGTAA 180 

AAGCAGCACC TTTGT6ACGT TTTAACTTTA GTATTCCTCT CCTTCTTCCT CACCCTCTCC 240 

TTCAACAGAA TCCACACCAA CCTCCTCATA ATCCTTCCTC GCAGCACATG AATCACAGGT 300 

ATTCCTACTG CAAGCGGGAG GCGGARGARC GGGAAGCGGC GGARCGCGAR GCGCGCGAGA 360 

AAGGGCACTT GGAACCCACC GAGCTGCTGA TGAACCGGGC TTACTTGCAG AGCATTACCC 420 

CTCAGGGGTA CTCTGACTCG GAGGAGAGGG AGAGTATGCC GAGGGATGGC GAGAGCGAGA 480 

AGGAGCACGA GAAAGAAGGC GAGGATGGCT ACGGGAAGCT GGGCAGACAG GATGGCGACG 540 

AGGAGTTCGA GGAGGAAGAG GAAGAAAGTG AAAATAAAAG TATGGATACG GATCCCGAAA 600 

CGATACGAGA TGAAAAAGAG ACTGGAGATC ACTCCATGGA CGATAGTTCG GAGGATGGGA 660 

AAATGGAAAC CAAATCAGAC CACGAGGAAG ACAATATGGA AGATGGCATG TAATAAACTA 720 

CTGCATTTTA AGCTTCCTAT TTTTTTTTCC AGTAGTATTG TTACCTGCTT GAAAACACTG 780 

CTGTGTTAAG CTGTTCATGC ACGTGCCTGA CGCTTCCAGG AAGCTGTAGA GAGGGACAGA 840 

AGGGGCGGTT CAGCCAAGAC AGATGTWGAC GGAGTTGGAG CTGGGTATTG TTAAAAACTG 900 
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CATTATGCAA AAATTTTGTA CAGTGTTAAG GCCTAAAAAC TGTGTGGTTC AGAGACTAAT 950 
TCCTGTGTTT AATAGCATTT ATACTTTAAG CACAACTAGA AAATTGTAAG AATTCCACTC 1020 
TACTTATGTA TCACTACAAA CTTTAAAAAA CTATGTCTAA TTOATATTAA TACATTTTAA 1080 
AAAGGTGCCC GCACTACCAT ACATCAGTAT TTTTATTATT ATTATTGTTA TTCCTTTTTA 1140 
ATTTAATGTG CTCGCACTAC AATGCATCAG TATTATGATT CCTCTGTACT TTCCTTTCGC 1200 
TATTCATCAA TTTCCCATTT TTTTTTTCAG CTTAAGTAAC CACACAATTT TAGGCCTCAA 1260 
TTTTTTTTTT TCT6TGAAG6 AACTTGAAGT GATGCATGIG TGAATTTAAG ATACCGAAGT 1320 
CTTAAAGTGA CCTGGACGTG AAGGAAAAAG TAAGATGAGA AATAAAGAAA GCCTTTCTAA 1380 
GGTGGTTTTA AAAGCCTTAT ATGCAAACCT TTTAATCTCT GTTTCTCCAA GTGCCATCCT 1440 
TGTACAGTGT TAAGAGGGTA ACATGGGTTA CCTTTCCACC AGCTTCAGTC TTAAGCTCAC 1500 
CCTGTTCTTT GAAGCACCCA TGTCAGTATT AGAAGAATAG GCAGCAGTTC CTTAGTTTAC 1560 
ATATGTTTGT GCAATTATTT TCTGTACTTT TTTOTTCATT AArTTTCTCA GTATOACACC 1620 
AAACTGTTTT TGCAACAAAA AAATTTTTTT TGCATTCATT TAATTTTAGG TCAAATAACA 1680 

TTTTATTTAT GTGGCTCATT TTATATTTCC TAATTTTATT TAnTCATAC TGTAGTGTAC 1740 

AGTATTATAG TTCTTCAATA TATAGATATA TTTOAGTAAA AAAGGAACAT GACGTTGATC 1800 

ATTTGGGCAA ATTTTACGTA AAGAGAAGAG CATTTATTGT GTTTTGGAAC ATTAATTOTG 1860 

AGATGGGATT rTTCAATTTT ATTATTTWAT TTTTGTTTTT TTCCAATOAC TGGAAAITCC 1920 

AAATTTGGGA ACTTTTGATA CGATCTTGTG AAAACACTGT ATTTTCGACT GAAAATTCCA 1980 

CTTTCTTCAT CTTGTTTTTT AGCTAAAAAG AGGGACTGTT AAATACAATG TATGATACCA 2040 

TGACAAAAAT CTTTCCTGAA TTGTCCTTTG TAAAAGTATT ATTGAATTTT CAA1TTGTAA 2100 

TTTCTTTTGA AAATGACCAT GCTCGAATAA AAATCTAGCC AAACTAAAAA AAAAAAAAAA 2160 



A 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 141 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



2161 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Met Asn His Arg Tyr Ser Tyr Cys Lys Arg Glu Ala Glu Glu Arg Glu 
15 10 15 



Ala Ala Glu Arg Glu Ala Arg Glu 
20 

Leu Leu Met Asn Arg Ala Tyr Leu 
35 40 

Ser Asp Ser Glu Glu Arg Glu Ser 
50 55 

Lys Glu His Glu Lys Glu Gly Glu 
65 70 

Gin Asp Gly Asp Glu Glu Phe Glu 
85 

Lys Ser Met Asp Thr Asp Pro Glu 
100 



Lys Gly His Leu Glu Pro Thr Glu 
25 30 

Gin Ser lie Thr Pro Gin Gly Tyr 
45 

Met Pro Arg Asp Gly Glu Ser Glu 
60 

Asp Gly Tyr Gly Lys Leu Gly Arg 
75 80 

Glu Glu Glu Glu Glu Ser Glu Asn 
90 95 

Thr lie Arg Asp Glu Lys Glu Thr 
105 110 



Gly Asp His Ser Met Asp Asp Ser Ser Glu Asp Gly Lys Met Glu Thr 
115 120 125 

Lys Ser Asp His Glu Glu Asp Asn Met Glu Asp Gly Met 
130 135 140 

(2) INFORMATION FOR SEQ ID NO: 27: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2169 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

GCAGTTACTG GGARGGGGCT TGCTGTGGCC CTGTCAGGAA RARTAGAGCT CTGGTCCAGC 60 

TCCGCGCAGG GAGGGAGGCT GTCACCATGC CGGCCTGCTG CAGCTGCAGT GATGTTTTCC 120 

AGTATGAGAC GAACAAAGTC ACTCGGATCC AGAGCATGAA TTATGGCACC ATTAAGTGGT 180 

TCTTCCACGT GATCATCTTT TCCTACGTTT GCTTTGCTCT GGTGAGTGAC AA6CTGTACC 240 

AGCGGAAAGA GCCTGTCATC AGTTCTGTGC ACACCAAGGT GAAGGGGATA GCAGARGTGA 300 

AAGAGGAGAT CGTGGAGAAT GGAGTGAAGA AGTTGGTGCA CAGTGTCTTT GACACCGCAG 360 
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ACTACACCTT CCCTTTGCAG GGGAACTCTT TCTTCGTGAT GACAAACTTT CTCAAAACAG 420 

AAGGCCAAGA GCAGCGGTTG TGTCCCGAGT ATCCCACCCG CAGGACGSTS TGTTCYTCTG 480 

ACCGAGGTTG WAAAAAGGGA TGGATGGACC CGCAGAGCAA AGGAATTCAG ACCGGAAGGT 540 

GTGTA6TGCA TGAAGGGAAC CAGAAGACYT GTGAAGTCTY TGCCTGGWGC CCCATSGAGG 500 

CAGTGGAAGA GGCCCCCCGG CCTGCTYTCT TGAACAGTGC CGAAAACTTC ACTGTGCTCA 560 

TCAAGAACAA TATCGACTTC CCCGGCCACA ACTACACCAC GAGAAACATC CTGCCAGGTT 720 

TAAACATCAC TTGTACCTTC CACAAGACTC AGAATCCACA GTGTCCCATT TTCCGACTAG 780 

GAGACATCTT CCGAGAAACA GGCGATAATT TTTCAGATGT GGCAATTCAG GGCG6AATAA 840 

TGGGCATTGA GATCTACTGG GACTGCAACC TAGACCGTTG GTTCCATCAC TGCCATCCCA 900 

AATACAGTTT CCGTCGCCTT GACGACAAGA CCACCAACGT GTCCTTGTAC CCTGGCTACA 960 

ACTTCAGATA CGCCAAGTAC TACAAGGAAA ACAATGTTGA GAAACGGACT CTGATAAAAG 1020 

TCTTCGGGAT CCGTTTTGAC ATCCTGGTTT TTCGCACCGG AGGAAAATTT GACATTATCC 1080 

AGCTGGTTGT GTACATCGGC TCAACCCTCT CCTACTTCG6 TCTGGCCGCT GTCTTCATCG 1140 

ACTTCCTCAT CGACACTTAC TCCAGTAACT GCTGTCGCTC CCATATTTAT CCCTGGTGCA 1200 

AGTGCTGTCA GCCCTGTGTG GTCAACGAAT ACTACTACAG GAAGAAGTGC GAGTCCATTG 1250 

TGGAGCCAAA GCCGACATTA AAGTATGTGT CCTTTGTGGA TGAATCCCAC ATTAGGATGG 1320 

TGAACCAGCA GCTACTAGGG AGAAGTCTGC AAGATGTCAA GGGCCAAGAA GTCCCAAGAC 1380 

CTGCGATGGA CTTCACAGAT TTGTCCAGGC TGCCCCTCGC CCTCCATGAC ACACCCCCGA 1440 

TTCCTGGACA ACCAGAGGAG ATACAGCTGC TTAGAAAGGA GGCGACTCCT AGATCCAGGG 1500 

ATAGCCCCGT CTGGTGCCAG TGTGGAAGCT GCCTCCCATC TCAACTCCCT GA6AGCCACA 15 SO 

GGTGCCTGGA GGAGCTGTGC TGCCGGAAAA AGCCGGGGGC CTCCATCACC ACCTCAGAGC 1620 

TGTTCAGGAA GCTGGTCCTG TCCAGACACG TCCTCCAGTT CCTCCTGCTC TACCAGGAGC 1680 

CCTTGCTGGC GCTGGATGTG GATTCCACCA ACAGCCGGCT GCGGCACTCT GCCTACAGGT 1740 

GCTACGCCAC CTGGCGCTTC GGCTCCCAGG ACATCGCTGA CTTTGCCATC CTGCCCAGCT 1800 

GCTGCCGCTG GAGGATCCGG AAAGAGTTTC CGAAGAGTGA AGGGCAGTAC AGTCGCTTCA 1860 

AGAGTCCTTA CTGAAGCCAG GCACCGTGGC TMACGTCTGT AATCCCAGCG CTTTGGGAGG 1920 

CCGAGGCAGG CAGATCACCT GAGGTCGGGA GTTGGAGACC CGCCTGGCTA ACAAGGCGAA 1980 

ATCCTGTCTG TACTAAAAAT ACAAAAATCA GCCAGACATG GTGGCATGCA CCTCCAATCC 2040 



90 



wo 98/17687 



PCT/US97/19590 



CAGCTACTCG GGAGGCTGAG GCACAAGAAT CACTTGAACC CGGGAGGCAG AGGTTGTAGT 2100 

GAGCCCAGAT TGTGCCACTG CTYTCCAGCC TGGGAGGCAC AGCAAACTGT CCCCAAAAAA 2160 

AAAAAAAAA 2169 
(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 595 amino acids 

(B) TYPE; amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 

Met Pro Ala Cys Cys Ser Cys Ser Asp Val Phe Gin Tyr Glu Thr Asn 
15 10 15 

Lys Val Thr Arg lie Gin Ser Met Asn Tyr Gly Thr lie Lys Trp Phe 
20 25 30 

Phe His Val lie He Phe Ser Tyr Val Cys Phe Ala Leu Val Ser Asp 
35 40 45 

Lys Leu Tyr Gin Arg Lys Glu Pro Val He Ser Ser Val His Thr Lys 
50 55 60 

Val Lys Gly He Ala Glu Val Lys Glu Glu He Val Glu Asn Gly Val 
65 70 75 80 

Lys Lys Leu Val His Ser Val Phe Asp Thr Ala Asp Tyr Thr Phe Pro 
85 90 95 

Leu Gin Gly Asn Ser Phe Phe Val Met Thr Asn Phe Leu Lys Thr Glu 
100 105 110 

Gly Gin Glu Gin Arg Leu Cys Pro Glu Tyr Pro Thr Arg Arg Thr Xaa 
115 120 125 

Cys Ser Ser Asp Arg Gly Xaa Lys Lys Gly Trp Met Asp Pro Gin Ser 
130 135 140 

Lys Gly He Gin Thr Gly Arg Cys Val Val His Glu Gly Asn Gin Lys 
145 150 155 160 

Thr Cys Glu Val Xaa Ala Trp Xaa Pro Xaa Glu Ala Val Glu Glu Ala 
165 170 175 

Pro Arg Pro Ala Xaa Leu Asn Ser Ala Glu Asn Phe Thr Val Leu He 
180 185 190 
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Lys Asn Asn He Asp Phe Pro Gly His Asn Tyr Thr Thr Arg Asn He 

200 205 

Leu Pro Gly Leu Asn He Thr Cys Thr Phe His Lys Thr Gin Asn Pro 
210 215 220 

Gin Cys Pro He Phe Arg Leu Gly Asp He Phe Arg Glu Thr Gly Asp 
230 235 240 

Asn Phe Ser Asp Val Ala He Gin Gly Gly He Met Gly He Glu He 
245 250 255 

Tyr Trp Asp Cys Asn Leu Asp Arg Trp Phe His His Cys His Pro Lys 
260 265 270 

Tyr Ser Phe Arg Arg Leu Asp Asp Lys Thr Thr Asn Val Ser Leu Tyr 
275 280 285 

Pro Gly Tyr Asn Phe Arg Tyr Ala Lys Tyr Tyr Lys Glu Asn Asn Val 
290 295 300 

Glu Lys Arg Thr Leu He Lys Val Phe Gly He Arg Phe Asp He Leu 
310 315 320 

Val Phe Gly Thr Gly Gly Lys Phe Asp He He Gin Leu Val Val Tyr 
325 330 335 

He Gly Ser Thr Leu Ser Tyr Phe Gly Leu Ala Ala Val Phe He Asp 
340 345 350 

Phe Leu He Asp Thr Tyr Ser Ser Asn Cys Cys Arg Ser His He Tyr 
355 360 365 

Pro Trp Cys Lys Cys Cys Gin Pro Cys Val Val Asn Glu Tyr Tyr Tyr 
370 375 380 

Arg Lys Lys Cys Glu Ser He Val Glu Pro Lys Pro Thr Leu Lys Tyr 
390 395 400 

Val Ser Phe Val Asp Glu Ser His He Arg Met Val Asn Gin Gin Leu 
405 410 415 

Leu Gly Arg Ser Leu Gin Asp Val Lys Gly Gin Glu Val Pro Arg Pro 
420 425 430 

Ala Met Asp Phe Thr Asp Leu Ser Arg Leu Pro Leu Ala Leu His Asp 
435 440 445 

Thr Pro Pro He Pro Gly Gin Pro Glu Glu He Gin Leu Leu Arg Lys 
450 455 460 

Glu Ala Thr Pro Arg Ser Arg Asp Ser Pro Val Trp Cys Gin Cys Gly 
465 470 475 

Ser Cys Leu Pro Ser Gin Leu Pro Glu Ser His Arg Cys Leu Glu Glu 
485 490 495 
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Leu Cys Cys Arg Lys Lys Pro Gly Ala Cys He Thr Thr Ser Glu Leu 
500 505 510 

Phe Arg Lys Leu Val Leu Ser Arg His Val Leu Gin Phe Leu Leu Leu 
515 520 525 

Tyr Gin Glu Pro Leu Leu Ala Leu Asp Val Asp Ser Thr Asn Ser Arg 
530 535 540 

Leu Arg His Cys Ala Tyr Arg Cys Tyr Ala Thr Trp Arg Phe Gly Ser 
545 550 555 560 

Gin Asp Met Ala Asp Phe Ala He Leu Pro Ser Cys Cys Arg Trp Arg 
565 570 575 

He Arg Lys Glu Phe Pro Lys Ser Glu Gly Gin Tyr Ser Gly Phe Lys 
580 585 590 

Ser Pro Tyr 
595 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY:- linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29; 
GNGTGAAGTCT TCCTGGGAAC CATAATCT 29 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
TNTTCCCTGAA GAGCTGGAGA GGTGCTAA 
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(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
GNTCTACCATG TGAAGAAGGA ACGCAAAA 
(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
TNGGCAAAGCT GTGCTGCAGG ATAGAGTG 
(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 
ANGTGAAGGAC TGTAAGCCCA CACAAGCT 
(2) INFORMATION FOR SEQ ID NO: 34: 
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(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 

(ii) MOLECtJLE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
TNATGACAGCC CAGGCTGTGT TATCTGCA 29 
(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
CNGGGCCTTCG TTTTCACTAC CTTCAGAA 29 
(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

TNAAGACGGTG ACTGAGAGGC AGACAAGG 29 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 29 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
TNCCTAGTCCC TTTAGGTAGC TCTGCAGA 
(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

<A) DESCRIPTION: /desc = "oligonucleotide* 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
GNGCACGAGAA AGAAGGCGAG GATGGCTA 
(2) INFORMATION FOR SEQ ID NO: 39; 

Ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECXJLE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
TNGGACACTGT GGATTCTGAG TCTTGTGG 
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What is claimed is: 

1. A composition comprising an isolated polynucleotide selected from the 
group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:l; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:l from nucleotide 437 to nucleotide 1159; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:l from nucleotide 515 to nucleotide 1159; 

(d) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:l from nucleotide 539 to nucleotide 1099; 

(e) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of done AR415_4 deposited under accession 
number ATCC 98232; 

(f) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone AR415_4 deposited xmder accession number ATCC 98232; 

(g) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of done AR415_4 deposited tmder accession number 
ATCC 98232; 

(h) a polynudeotide encoding the mature protein encoded by the 
cDNA irisert of clone AR415_4 deposited imder accession number ATCC 98232; 

(i) a polynudeotide encoding a protein comprising the amino add 
sequence of SEQ ID NO:2; 

(j) a polynudeotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:2 having biological activity; 

(k) a polynudeotide M^hich is an allelic variant of a polynudeotide of 
(a)-(h) above; 

(1) a polynucleotide which encodes a spedes homologue of the protein 
of (i) or (j) above ; and 

(m) a polynudeotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(j). 

2. A composition of claim 1 wherein said pol3aiucleotide is operably linked 
to an expression control sequence. 
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A host cell transformed with a composition of claim 2. 



4. The host cell of claim 3, wherein said cell is a mammalian cell. 

5. A process for producing a protein, which comprises: 

(a) growing a culture of the host cell of claim 3 in a suitable culture 
medium; and 

(b) purifying the protein from the culture. 

6. A protein produced according to the process of claim 5. 

7. The protein of claim 6 comprising a mature protein. 

8. A composition comprising a protein, wherein said protein comprises an 
amino add sequence selected from the group consisting of: 

(a) ti\e amino acid sequence of SEQ ID NO:2; 

(b) the amino acid sequence of SEQ ID NO:2 from amino add 51 to 
amino acid 221; 

(c) fragments of the amino acid sequence of SEQ ID NO:2; and 

(d) the amino add sequence encoded by the cDNA insert of done 
AR415_4 deposited under accession number ATCC 98232; 

the protein being substantially free from other mammalian proteins. 

9. The composition of daim 8, wherein said protein comprises the amino add 
sequence of SEQ ID NO:2. 

10. The composition of daun 8, wherein said protein comprises the amino acid 
sequence of SEQ ID NO:2 from amino add 51 to amino add 221, 

11. The composition of daim 8, further comprising a pharmaceutically 
acceptable carrier. 
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12. A method for preventing, treating or ameliorating a medical condition 
which comprises administering to a mammalian subject a therapeutically effective amoimt 
of a composition of claim 11. 

13. An isolated gene corresponding to the cDNA sequence of SEQ ID N0:1. 

14. A composition comprising an isolated polynucleotide selected from the 
group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

Na3; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:3 from nucleotide 59 to nucleotide 376; 

(c) a pol)aiucleotide comprising the nucleotide sequence of SEQ ID 
NO:3 from nucleotide 179 to nucleotide 376; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of done AS63_29 deposited under accession 
number ATCC 98232; 

(e) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone AS63_29 deposited under accession niunber ATCC 98232; 

(f) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone AS63_29 deposited imder accession number 
ATCC 98232; 

(g) a polynucleotide encoding the mature protein encoded by the 
cDNA irisert of clone AS63_29 deposited under accession number ATCC 98232; 

(h) a polynucleotide encoding a protein comprising the amino add 
sequence of SEQ ID NO:4; 

(i) a polynudeotide encoding a protein comprising a fragment of the 
amino add sequence of SEQ ID NO:4 having biological activity; 

(j) a polynudeotide v^^hich is an allelic variant of a polynucleotide of 
(a)-(g) above; 

(k) a polynudeotide which encodes a spedes homologue of the protein 
of (h) or (i) above ; and 

(1) a polynudeotide capable of hybridizing under stringent conditions 
to any one of the polynudeotides specified in (a)-(i). 
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15. A composition comprising a protein, wherein said protein comprises an 
amino acid sequence selected from the group consisting of: 

(a) the amino acid sequence of SEQ ID NO:4; 

(b) the amino acid sequence of SEQ ID NO:4 from amino add 1 to 
amino acid 91; 

(c) fragments of the amino acid sequence of SEQ ID NO:4; and 

(d) the amino add sequence encoded by the cDNA insert of done 
AS63_29 deposited under accession nimiber ATCC 98232; 

the protein being substantiaUy free from other mammalian proteins. 

16. An isolated gene corresponding to the cDNA sequence of SEQ ID N0:3 or 
SEQ ID NO:5, 

17. A composition comprising an isolated polynudeotide selected from the 
group consisting of: 

(a) a polynucleotide comprising the nudeotide sequence of SEQ ID 

NO:6; 

(b) a polynudeotide comprising the nudeotide sequence of SEQ ID 
NO:6 from nudeotide 198 to nudeotide 2039; 

(c) a polynudeotide comprising the nudeotide sequence of SEQ ID 
NO:6 from nudeotide 490 to nudeotide 809; 

(d) a polynudeotide comprising the nucleotide sequence of the full- 
length protein coding sequence of done AY304_14 deposited under accession 
number ATCC xxxxx; 

(e) a polynudeotide encoding the full-length protein encoded by the 
cDNA insert of done AY304 J4 deposited under accession number ATCC xxxxx; 

(f) a polynucleotide comprising the nudeotide sequence of the mature 
protein coding sequence of done AY304_14 deposited under accession number 
ATCC xxxxx; 

(g) a polynudeotide encoding the mature protein encoded by the 
cDNA insert of clone AY304_14 deposited under accession number ATCC xxxxx; 

(h) a polynudeotide encoding a protein comprising the amino add 
sequence of SEQ ID NO:7; 
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(i) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:7 having biological activity; 

(j) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(g) above; 

(k) a polynucleotide which encodes a species homologue of the protein 
of (h) or (i) above ; and 

(1) a polynucleotide capable of hybridizing imder stringent conditions 
to any one of the polynucleotides specified in (a)-(i). 

18. A composition comprising a protein, wherein said protein comprises an 
amino acid sequence selected from the group consisting of: 

(a) the amino acid sequence of SEQ ID NO:7; 

(b) the amino add sequence of SEQ ID NO:7 from amino add 126 to 
amino add 204; 

(c) the amino add sequence of SEQ ID NO:7 from amino add 106 to 
amino add 204; 

(d) fragments of the amino acid sequence of SEQ ID NO:7; and 

(e) the amino acid sequence encoded by the cDNA insert of clone 
AY304_14 deposited tmder accession number ATCC xxxxx; 

the protein being substantially free from other mammalian proteins. 

19. An isolated gene corresponding to the cDNA sequence of SEQ ID NO:6. 

20. A composition comprising an isolated polynudeotide selected from the 
group consisting of: 

(a) a polynudeotide comprising the nucleotide sequence of SEQ ID 

NO:8; 

(b) a polynucleotide comprising the nudeotide sequence of SEQ ID 
NO:8 from nucleotide 102 to nucleotide 2027; 

(c) a polynudeotide comprising the nucleotide sequence of SEQ ID 
NO:8 from nucleotide 1902 to nudeotide 2027; 

(d) a polynucleotide comprising the nudeotide sequence of SEQ ID 
NO:8 from nucleotide 1 to nucleotide 431; 



101 



wo 98/17687 



PCT/US97/19590 



(e) a polynucleotide comprising the nucleotide sequence of the hill- 
length protein coding sequence of clone BG160_1 deposited under accession 
number ATCC 98232; 

(f) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone BG160_1 deposited under accession number ATCC 98232; 

(g) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone BG160„1 deposited under accession number 
ATCC 98232; 

(h) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone EG 160 J deposited under accession number ATCC 98232; 

(i) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:9; 

(j) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:9 having biological activity; 

(k) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(h) above; 

(1) a polynucleotide which encodes a spedes homologue of the protein 
of (i) or (j) above ; and 

(m) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(j). 

21 . A composition comprising a protein, wherein said protein comprises an 
amino acid sequence selected from the group consisting of: 

(a) the amino acid sequence of SEQ ID NO:9; 

(b) the amino acid sequence of SEQ ID NO:9 from amino add 1 to 
amino acid 110; 

(c) fragments of the amino add sequence of SEQ ID NO:9; and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
BG160_1 deposited under accession number ATCC 98232; 

the protein being substantially free from other mammalian proteins. 

22. An isolated gene corresponding to the cDNA sequence of SEQ ID N0:8. 
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23. A composition comprising an isolated polynucleotide selected from the 
group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:ll; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:ll from nucleotide 566 to nucleotide 631; 

(c) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone B0432_4 deposited under accession 
number ATCC 98232; 

(d) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone B0432_4 deposited vmder accession ninnber ATCC 98232; 

(e) a polynucleotide comprising the nucleotide sequence of the matiu^ 
protein coding sequence of clone B0432_4 deposited vmder accession number 
ATCC 98232; 

(f) a polynucleotide encoding the mature protein encoded by the 
cDN A insert of clone B0432„4 deposited under accession number ATCC 98232; 

(g) a polynucleotide encoding a protein comprising the amino add 
sequence of SEQ ID NO:12; 

(h) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:12 having biological activity; 

(i) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(f) above; 

(j) a polynucleotide which encodes a spedes homologue of the protein 
of (g) or (h) above ; and 

(k) a polynudeotide capable of hybridizing tmder stringent conditions 
to any one of the polynucleotides specified in (a)-{h). 

24. A composition comprising a protein, wherein said protein comprises an 
amino acid sequence selected from the group consisting of: 

(a) the amino acid sequence of SEQ ID NO:12; 

(b) fragments of the amino acid sequence of SEQ ID NO:12; and 

(c) the amino acid sequence encoded by the cDNA insert of done 
B0432_4 deposited under accession number ATCC 98232; 

the protein being substantially free from other mammalian proteins. 
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25. An isolated gene corresponding to the cDNA sequence of SEQ ID NO:l 1, 
SEQ ID NO:10 or SEQ ID NO:13 . 



26. A composition comprising an isolated polynucleotide selected from the 
group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:14; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO: 14 from nucleotide 45 to nucleotide 428; 

(c) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone B0538_2 deposited under accession 
number ATCC 98232; 

(d) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone B0538_2 deposited under accession number ATCC 98232; 

(e) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of done B0538_2 deposited under accession number 
ATCC 98232; 

(0 a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone B0538_2 deposited xmder accession number ATCC 98232; 

(g) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:15; 

(h) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:15 having biological activity; 

(i) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(f) above; 

(j) a polynucleotide which encodes a species homologue of the protein 
of (g) or (h) above ; and 

(k) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(h). 

27. A composition comprising a protein, wherein said protein comprises an 
amino acid sequence selected from the group consisting of: 

(a) the amino acid sequence of SEQ ID NO: 15; 
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(b) the amino add sequence of SEQ ID NO:lS from amino acid 52 to 
amino acid 128; 

(c) fragments of the amino acid sequence of SEQ ID NO:15; and 

(d) the amino acid sequence encoded by the cDNA insert of done 
B0538.2 deposited under accession number ATCC 98232; 

the protein being substemtially free from other mammalian proteins. 

28. An isolated gene corresponding to the cDNA sequence of SEQ ID NP:14 
orSEQIDNO:16. 

29. A composition comprising an isolated polynucleotide selected from the 
group consisting of: 

(a) a polynudeotide comprising the nucleotide sequence of SEQ ID 

NO:17; 

(b) a polynudeotide comprising the nucleotide sequence of SEQ ID 
NO:17 from nudeotide 144 to nucleohde 566; 

(c) a polynudeotide comprising the nudeotide sequence of the full- 
length protein coding sequence of done BR595_4 deposited under accession 
number ATCC 98232; 

(d) a polynudeotide encoding the full-length protein encoded by the 
cDNA insert of clone BR595_4 deposited under accession number ATCC 98232; 

(e) a polynudeotide comprising the nucleotide sequence of the mature 
protein coding sequence of done BR595_4 deposited under accession number 
ATCC 98232; 

(f) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone BR595_4 deposited imder accession number ATCC 98232; 

(g) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:18; 

(h) a polynudeotide encoding a protein comprising a fragment of the 
amino add sequence of SEQ ID NO:18 having biological activity; 

(i) a polynudeotide which is an allelic variant of a polynucleotide of 
(a)-(f) above; 

(j) a polynucleotide which encodes a spedes homologue of the protein 
of (g) or (h) above ; and 
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(k) a polynucleotide capable of hybridizing xinder stringent conditions 
to any one of the polynucleotides specified in (a)-(h). 

30, A composition comprising a protein, wherein said protein comprises an 
amino acid sequence selected from the group consisting of: 

(a) the amino add sequence of SEQ ID NO: 18; 

(b) the amino add sequence of SEQ ID N0:18 from amino acid 39 to 
amino acid 141; 

(c) fragments of the amino acid sequence of SEQ ID NO:18; and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
BR595_4 deposited under accession number ATCC 98232; 

the protein being substantially free from other mammalian proteins. 

31- An isolated gene corresponding to the cDNA sequence of SEQ ID NO:17 
or SEQ ID NO:19. 



32. A composition comprising an isolated polynucleotide selected from the 
group consisting of: 

(a) a polynucleotide comprising the nudeotide sequence of SEQ ID 

NO:20; 

(b) a polynucleotide comprising the nudeotide sequence of SEQ ID 
NO:20 from nudeotide 232 to nucleotide 1041; 

(c) a polynucleotide comprising the nudeotide sequence of SEQ ID 
NO:20 from nudeotide 460 to nucleotide 1041; 

(d) a polynucleotide comprising the nudeotide sequence of SEQ ID 
NO:20 from nucleotide 590 to nudeotide 1163; 

(e) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of done a490_2 deposited under accession 
number ATCC 98232; 

(f) a polynucleoHde encoding the full-length protein encoded by the 
cDNA insert of clone CI490_2 deposited under accession number ATCC 98232; 

(g) a polynudeotide comprising the nudeotide sequence of the mature 
protein coding sequence of clone 0490.2 deposited under accession number 
ATCC 98232; 
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(h) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone a490_2 deposited under accession number ATCC 98232; 

(i) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID Na21; 

(j) a polynucleotide encoding a protein comprising a fragment of the 
amino add sequence of SEQ ID NO:21 having biological activity; 

(k) a polynucleotide v^hich is an allelic variant of a polynucleotide of 
(a)-(h) above; 

(1) a polynucleotide which encodes a species homologue of the protein 
of (i) or (j) above ; and 

(m) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(j)- 

33. A composition comprising a protein, wherein said protein comprises an 
amino acid sequence selected from the group conisisting of: 

(a) the amino acid sequence of SEQ ID NO:21; 

(b) the amino add sequence of SEQ ID NO:21 from amino acid 133 to 
amino acid 270; 

(c) fragments of the amino add sequence of SEQ ID NO:21; and 

(d) the amino acid sequence encoded by the cDNA insert of done 
CI490_2 deposited imder accession number ATCC 98232; 

the protein being substantially free from other mammalian proteins. 

34. An isolated gene corresponding to the cDNA sequence of SEQ ID NO:20. 

35. A composition comprising an isolated polynucleotide selected from the 
group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:22; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:22 from nucleotide 268 to nucleotide 624; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:22 from nucleotide 325 to nucleotide 624; 
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(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone CI522_1 deposited under accession 
number ATCC 98232; 

(e) a polynucleotide encoding the fulHength protein encoded by the 
cDNA insert of clone a522_l deposited under accession number ATCC 98232; 

(f) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone CI522.1 deposited under accession number 
ATCC 98232; 

(g) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone a522_l deposited under accession number ATCC 98232; 

(h) a polynucleotide encoding a protein comprising the amino add 
sequence of SEQ ID NO:23; 

(i) a polynucleotide encoding a protein comprising a fragment of the 
amino add sequence of SEQ ID NO:23 having biological activity; 

(j) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(g) above; 

(k) a polynudeotide which encodes a spedes homologue of the protein 
of (h) or (i) above ; and 

(1) a polynudeotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(i). 

36. A composition comprising a protein, wherein said protein comprises an 
amino add sequence selected from the group consisting of: 

(a) the amino add sequence of SEQ ID NCh23; 

(b) fragments of the amino acid sequence of SEQ ID NO:23; and 

(c) the amino add sequence encoded by the cDNA insert of done 
CI522„1 deposited under accession number ATCC 98232; 

the protein being substantially free from other mammalian proteins. 

37. An isolated gene corresponding to the cDNA sequence of SEQ ID NO:22 
orSEQIDNO:24. 

38. A composition comprising an isolated polynucleotide sdected from the 
group consisting of: 
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(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:25; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:25 from nucleotide 288 to nucleotide 713; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:25 from nucleotide 686 to nucleotide 968; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone CN238_1 deposited under accession 
number ATCC 98232; 

(e) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone CN238_1 deposited under accession number ATCC 98232; 

(f) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone CN238_1 deposited under accession number 
ATCC 98232; 

(g) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone CN238_1 deposited imder accession number ATCC 98232; 

(h) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:26; 

(i) a polynucleotide encoding a protein comprising a fragment of the 
amino add sequence of SEQ ID NO:26 having biological activity; 

(j) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(g) above; 

(k) a polynucleotide which encodes a species homologue of tiie protein 
of (h) or (i) above ; and , 

(1) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(i). 

39. A composition comprising a protein, wherein said protein comprises an 
amino acid sequence selected from the group consisting of: 

(a) the amino acid sequence of SEQ ID NO:26; 

(b) fragments of the amino acid sequence of SEQ ID NO:26; and 

(c) the amino acid sequence encoded by the cDNA insert of clone 
CN238_1 deposited imder accession number ATCC 98232; 

the protein being substantially free from other mammalian proteins. 
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40. An isolated gene corresponding to the cDNA sequence of SEQ ID NO:25. 

41. A composition comprising an isolated polynucleotide selected from the 
group consisting of: 

(a) a polynucleotide comprising the nucleotide secpience of SEQ ID 

NO:27; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:27 from nucleotide 87 to nucleotide 1874; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:27 froin nucleotide 452 to nucleotide 830; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone CO390.1 deposited under accession 
number ATCC 98232; 

(e) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone CO390,l deposited under accession number ATCC 98232; 

(f) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone CO390_l deposited under accession number 
ATCC 98232; 

(g) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone CO390J deposited under accession number ATCC 98232; 

(h) a polynucleotide encoding a protein comprising the amino add 
sequence of SEQ ID NO:28; 

(i) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NOr28 having biological activity; 

0) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(g) above; 

(k) a polynucleotide which encodes a species homologue of the protein 
of (h) or (i) above ; and 

(1) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(i). 

42. A composition comprising a protein, wherein said protein comprises an 
amino acid sequence selected from the group consisting of: 

(a) the amino acid sequence of SEQ ID Na28; 
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(b) the amino add sequence of SEQ ID from amino acid 140 to 
amino acid 248; 

(c) fragments of the amino add sequence of SEQ ID NO:28; and 

(d) the amino acid sequence encoded by the cDNA insert of done 
CO390_l deposited under accession number ATCC 98232; 

the protein being substantiaDy free froth other mammalian proteins. 

43. An isolated gene corresponding to the cDNA sequence of SEQ ID NO:27. 
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FIGURE lA 




BamHI 2772 Clal 2666 



Plasmid name: pED6dpc2 
Plasmid size: 5374 bp 



trrr'Tf '"^^'^ ^"^ ^^""^^ "V insertion of a new 

polyhnter ,o fac.litate cDNA cloning. SST cDNAs are cloned behveen EcoRI and No.1 
PED vectors are described in Kaufman et al.<1991). NAR 19: 4485-4490. 
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FIGURE IB 




Sapl 2135 



Plasmid name: pNOTs 
Plasmid size: 4529 bp 



Commants/Refrences: pNOTs is a derivative of pMT2 (Kaufman at aM989, MoLCell.Biol.9: 174 1-1750). 
OHFR was deleted and a new polylinker was inserted between EcoRI and Hpal. 1^13 origin 
of replication was inserted in the Clal site. SST cDNAs are doned between EcoRI and 
NotI 
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FIGURE IB 



riindlM 1 




Sapl 2135 



Plasmid name: pNOTs 
Ptasmid size: 4529 bp 



Comments/Relerencas: pNOTs is a derivative of pMT2 (Kaufman el al,1989. MoLCell.Biol.9: 1741-1750). 
DHFR was deleted and a new polylinker was inserted between EcoRI and Hpal. Ml3 origin 
ol replication was inserted in the Clal site. SST cONAs are cloned between EcoRI and 
Nott 
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